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Who watches the watchmen? 


Some commercial firms that oversee the ethics and scrutiny of clinical trials have been found 
wanting. Human volunteers in research deserve better. 


regulations covering research on human volunteers. It includes a 

move to allow single institutional review boards (IRBs) to oversee 
US multi-site clinical trials. Such boards underpin the ethics and safety 
of a trial, reviewing all research protocols and consent forms before it 
is launched. Currently, dozens or scores of IRBs — one for each trial 
site — can separately review and approve a clinical trial and its neces- 
sary paperwork, leading to massive duplication of effort and delays. 

Commercial IRBs have multiplied in recent decades, reviewing and 
passing proposals for profit. Their creation was spurred partly by an 
explosion of industry-sponsored clinical trials, but they are increas- 
ingly passed business from academic centres, to relieve overstretched 
in-house ethics committees. There are now dozens of these companies 
and they will see potential business in the proposed shift to single-IRB 
oversight of multi-site trials. Commercial boards advertise themselves 
as faster at approvals than their academic counterparts. This promised 
speed and convenience could see organizers of multi-site trials turn to 
a commercial board rather than, for instance, the IRB at the principal 
investigator's institution. 

Why does this matter? Commercial IRBs, like their academic 
counterparts, are as expert and committed as the people who sit 
on them. But the for-profit groups have an unsettling incentive to 
approve trials. If they are too tough, applicants may take their next 
proposal and their fees elsewhere. (In the proposed changes to the 
regulations, the government acknowledges that such ‘IRB shopping’ 
is a risk and asks for public comment on how it could be prevented.) 

There is evidence that not all the commercial IRBs are as thorough 
as researchers, and their human subjects, would like. In the same week 


L= last month, the US government proposed an overhaul of 


that the government published the proposed rule changes, the Essex 
Institutional Review Board in Lebanon, New Jersey, was issued with a 
warning letter from the Food and Drug Administration (FDA). 

A common way to test IRBs is to submit fictitious applications con- 
taining problems that should raise alarm bells. According to the FDA, 
when Essex received one of these sting applications, reportedly from 
journalists, it approved the trial without putting appropriate warnings — 
including ofa serious heart risk and of acommon risk of graft rejection 
— in the informed consent, or even discussing the heart risk, according 
to minutes of its meeting. Nor did it spot that both the trial sponsor 
and the lead investigator were invented. The company, which has done 
work for the National Institutes of Health’s AIDS Clinical Trials Group, 
among others, received similar FDA warnings more than a decade ago. 

The FDA has also censured other IRBs. Two years ago, the Coast 
IRB in Colorado Springs, Colorado, disbanded after the Government 
Accountability Office (GAO) submitted a phony clinical-trial applica- 
tion that the IRB approved. Some commercial IRBs carry out first-rate, 
conscientious work. Members of two other for-profit IRBs dismissed 
the phony GAO application as “awful” and “junk”. One reviewer called 
it the “riskiest thing I’ve ever seen on this board”. 

And IRBs at academic institutions have their own ulterior motives 
to approve trials — among them institutional profile and prestige. And 
they, too, make mistakes. But the dollar remains the most powerful 
incentive to cut corners. The volunteers on whom clinical trials depend 
deserve the highest possible protection. Before implementing change 
that could see private companies overseeing an increased number 
of multi-site trials, the government should ensure that scrutiny of these 
IRBs is expanded to match. = 


NEON and on 


The launch of an ecological monitoring 
network is good news at a difficult time. 


to wire up the natural landscape to a state-of-the-art environ- 

mental monitoring system. The National Ecological Observa- 
tory Network (NEON) will take Mother Nature's pulse and track her 
vital signs during both bad times and good. 

In the coming years, NEON will pour massive quantities of bio- 
logical and physical data from plains, forests and lakes into computer 
spreadsheets to look for trends (see page 135). It is a bold initiative 
whose time has clearly come, but it will take more than the flip of the 
on switch to make NEON glow. 


A fter years of wrangling, scientists in the United States are set 


The project's elegance is appealing, but behind its beguiling simplic- 
ity lurk a number of potential pitfalls. Some are technical: it would be 
foolhardy to underestimate the challenge of collecting hundreds of 
data streams in a uniform manner across dozens of sites and over the 
course of decades. Others are social and institutional: ecologists have 
long gathered data independently; NEON represents a dramatic shift 
towards the kind of collective data-driven system that physicists and 
astronomers have been pursuing for decades. How will it be received 
on the ground? And what's more, how many scientists will be ready 
to use the data when they come in? 

The plan for NEON is to develop a healthy community of scientists 
who can make good use of the data. There are no guarantees, but the 
project is a risk worth taking. Human pressure on the biosphere will 
only escalate, and the sooner scientists get a firm handle on what that 
means the better. NEON could be a powerful tool for those investiga- 
tions. And, in an exceedingly difficult fiscal environment, the new 
ecological network represents a refreshingly forward-looking initia- 
tive. Ecologists (and headline writers) everywhere should welcome it. m 
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WORLD VIEW .ernssicossen 


with the release of New Worlds, New Horizons in Astronomy and 

Astrophysics, the latest decadal survey by the National Academy of 
Sciences. The survey offered a strategy — based on realistic budgets 
and leveraged by international and private partnerships — to realize 
dazzling opportunities, including searching for life on other planets, 
identifying dark matter and understanding dark energy. It also promised 
to reveal the evolution of the first stars and galaxies and to probe whether 
supermassive black holes are accurately described by general relativity. 

It began well, with the Large Synoptic Survey Telescope, the highest 
priority for ground-based astronomy, put on a path for construction 
in 2014. But today, US astronomy is in disarray, community morale is 
low and the proposed cancellation of the James Webb Space Telescope 
(JWST), the cornerstone of the 2010 decadal 
survey, is the most glaring example of difficul- 
ties ahead. Although capabilities and budgets 
in Europe and Asia have largely caught up with 
those of the United States, the health of the US 
programme remains critical, not just domesti- 
cally, but also to the rest of the world. 

As the opportunities have soared and projects 
have become more complex and costly, govern- 
ment spending on science has been shrinking. 
NASAs science budget is in steady decline. The 
House of Representatives has said that it wants to 
cut another 10% from NASA for 2012 — and that 
was before the bill to cut domestic spending over 
the next decade was passed earlier this month. 

This has undermined NASA’ planned collab- 
oration with the European Space Agency (ESA) 
on the International X-ray Observatory (IXO) 
and on a space-based interferometer to detect gravity waves — the 
Laser Interferometer Space Antenna (LISA). Both partnerships were 
endorsed by the decadal survey, but the Europeans are now looking at 
cheaper and less-ambitious missions that they can do alone. 

The decadal survey’s top priority was the Wide Field Infrared 
Survey Telescope (WFIRST), designed to find exoplanets and probe 
dark energy, but budget trouble has pushed its earliest possible launch 
into the mid-2020s. By then, much of its key science could have been 
done by a similar ESA mission, Euclid. (Euclid is competing for a slot 
in ESA’s Cosmic Vision programme, with an announcement expected 
this October.) 

The JWST illustrates the complexity of the problems in the US 
astrophysics programme. With a 6.5-metre deployable mirror and 
powerful infrared instruments, it will be a hun- 


I: is barely 12 months since US astronomy was shown the future, 


dred times more sensitive than the Hubble Space NATURE.COM 
Telescope and a worthy successor. It willimage _ Discuss this article 
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NASA IS GOING TO 
NEED A STEADY 
STREAM OF 


EXCITING 
SCIENCE RESULTS 
TO CAPTURE 
THE PUBLIC'S 


IMAGINATION. 


The dark clouds over 
US astronomy 


The proposed cancellation of NASA’s latest space telescope shows the 
difficulties ahead. But there is a solution, says Michael S. Turner. 


at least US$1.6 billion over budget, and last month the House voted 
to terminate it. 

The JWST can still be saved, and should be. The budget for 2012 is 
yet to be settled and there is a groundswell of support for the telescope: 
professional societies around the globe have issued statements of 
support, online petitions have been organized and influential commen- 
tators have called for its completion. But if the JWST does launch, its 
cost could yet wreak havoc on the rest of the astrophysics programme. 

I believe that there is a path forward for astronomy. It will involve 
hard choices and better international coordination, but the scientific 
payoffs ahead demand that we try. 

First, it is essential that the United States finishes the JWST. As well 
as being the cornerstone of the future astrophysics programme, the 
telescope will transform global astronomy just as 
Hubble has. But NASA must spread the additional 
cost across the agency so that it does not cripple 
the rest of US astrophysics. This will not be easy, 
but without a clear plan for the post-shuttle era, 
NASA is going to needa steady stream of exciting 
science results to capture the public's imagination 
and loosen the purse strings of Congress. 

Second, ESA and NASA will have another 
opportunity to merge their competing dark- 
energy missions, Euclid and WFIRST, if Euclid 
is selected for a Cosmic Vision slot. The agencies 
should do so, and next year they should seek a 
renewed partnership on IXO and LISA. 

Third, agencies on both sides of the Atlantic 
have their sights set on a giant ground-based 
telescope, 30 metres or more in aperture, with 
complex adaptive optics to complement the 
JWST. Both sides are seeking international and private partners. 
A coherent strategy to build at least two of these grand machines of 
discovery — one for the northern sky and one for the southern — 
is essential. In the United States, the National Science Foundation 
must clarify the currently muddled situation by picking which of the 
competing private projects it wants for a partner. And scientists around 
the world must work together to develop the necessary technology and 
instrumentation for these billion-dollar telescopes. 

We are in an unprecedented age of discovery, with opportunities 
to answer the biggest questions humankind can ask. Where did we 
come from? Where are we going? Are we alone? If the global science 
community can figure out how to live on leaner budgets by working 
together more effectively, this could yet be a golden age for astronomy. = 


Michael S. Turner is Rauner distinguished service professor and 
director of the Kavli Institute for Cosmological Physics at the University 
of Chicago, Illinois. 

e-mail: mturner@uchicago.edu 
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RESEARCH HIGHLIGHTS 


STATISTICS 


Ten thousand 
Kims of Korea 


Kim is the most common 
surname in Korea, and has 
probably held pole position 
for more than a millennium, 
according to a statistical study. 

Petter Minnhagen at Umea 
University in Sweden and his 
colleagues studied ten Korean 
family trees that traced lineages 
back through 500 years. 

Using a statistical analysis that 
divides the tree data into family 
groups and predicts how they 
mix, the team was able to 
extrapolate the frequency of 
the name Kim back to ap 500. 
The model predicts that at 
that time 10,000 inhabitants 
— about one-fifth of the total 
population — shared the 
surname. 

The results suggest that 
Korean culture has been stable 
for hundreds of years, the 
authors say. 

N.J. Phys. 13, 073036 (2011) 


ZOOLOGY 


Poisonous surprise 
for predators 


By chewing on the bark ofa 
poisonous tree, the African 
crested rat acquires a toxin that 
it delivers if bitten by predators 
— the first placental mammal 
found to combat predation in 
this way. 

When threatened, the rat 
(Lophiomys imhausi) parts its 
grey fur to expose a pattern of 


Wy 
pie 


Selections from the 
scientific literature 


Louisiana marsh restoration has failed 


For two decades, fresh water from the 
Mississippi River has been diverted into 
Louisiana's shrinking coastal wetlands 
(pictured) in an attempt to restore them, but 
three such projects have failed to increase 
vegetation or marsh area. Moreover, the 
diversions have raised the marshes’ risk of 
suffering hurricane damage. 

Michael Kearney at the University of 
Maryland in College Park and his team analysed 
satellite images collected between 1984 and 
2009 for three of the longest-running diversion 
projects. They found that most new plant growth 


specialized black and white 
hairs on its flanks (pictured), 
daring its attacker to bite the 
target. The creature picks up 
the toxin with its saliva by 
gnawing on the Acokanthera 
schimperi tree, then drools on 
the hairs. 
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enclosing many long fibrils 

—a structure well adapted 

to absorbing the toxin and to 
delivering it upon contact with 
a predator's mouth. 

Proc. R. Soc. B http://dx.doi.org/ 
10.1098/rspb.2011.1169 (2011) 


consisted of floating plants and algae rather than 

deep-rooted marsh plants that hold soil in place. 

For one of the diversions, Hurricane Katrina 

destroyed the most vegetation in zones that 

received the most direct input of fresh water. 
The authors say that an influx of nutrients 

in the river water, probably from agricultural 

run-off and industry, has been damaging marsh 

plants’ roots. 

Geophys. Res. Lett. http://dx.doi.org/10.1029/ 

2011g1047847 (2011) 

For a longer story on this research, see 

go.nature.com/qticwp 


molecule in the process. 

A brain region called the 
hypothalamus contains 
neurons that regulate feeding 
in response to nutritional 
signals. Rajat Singh and 
his colleagues at the Albert 
Einstein College of Medicine 


Fritz Vollrath at in the Bronx, New York, found 
the University of that these neurons cannibalize 
y Oxford, UK, and his H Uu ngry cel Is eat their own organelles, proteins 
colleagues examined from within and lipid stores — a process 
the structure of these known as autophagy — when 
hairs under two types When food is scarce, some starved of nutrients. This 
of electron microscope. brain cells begin to devour liberates fatty acids that cause 
They found the hairs themselves, activating the cells to express higher levels 
to be porous cylinders an appetite-stimulating of an appetite-stimulating 
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compound called agouti- 
related peptide. 

When the researchers 
deleted a gene required for 
autophagy in the mouse 
neurons that produce this 
peptide, the mice ate less 
and were slimmer than 
control animals with normal 
autophagy. 

Cell Metab. 14, 173-183 (2011) 


Biofilm for 
yeast sex 


Some microorganisms stick 
together to form impenetrable, 
drug-resistant biofilms that 
can contaminate surfaces such 
as those of medical devices. 
The yeast Candida albicans 

is one such microbe, but 
certain strains form another 
type of biofilm — one that 

is susceptible to drugs and 

is regulated by a different 
molecular pathway. 

David Soll and his 
colleagues at the University 
of Iowa in Iowa City found 
that about 10% of C. albicans 
strains form this second 
type of biofilm, which is also 
permeable to immune cells. 
These strains have a genetic 
difference at one locus that 
allows the yeast to reproduce 
sexually. The authors suggest 
that this minority type of 
biofilm aids in the mating 
process by mediating the 
movement of hormone 
signals. 

Comparing the different 
pathways that regulate the 
two types of biofilm could 
yield clues about what makes 
a biofilm drug-susceptible, 
which could aid in the 
development of treatments to 
disrupt harmful biofilms. 
PLOS Biol. 9,e1001117 (2011) 


Alzheimer’s 
inadish 


Skin cells taken from patients 
with Alzheimer’s disease 

and converted into neurons 
exhibit molecular defects 

that have been linked to the 
neurodegenerative condition. 


Such cells could function as 
models for those working to 
understand the disease. 

Asa Abeliovich at Columbia 
University in New York 
and his colleagues created 
neurons from the fibroblasts 
of three patients with genetic 
mutations that cause a rare 
hereditary form of Alzheimer’s 
disease. The authors used 
retroviruses to introduce 
key genes into the skin cells, 
directly reprogramming them 
into neurons. 

Compared with converted 
cells from healthy people, 
those from patients with 
disease-related mutations 
produced higher levels of 
a protein fragment called 
amyloid-f, which is linked 
to Alzheimer’s disease. All 
reprogrammed neurons 
contained amyloid precursor 
protein — which is clipped to 
make amyloid-$ — but in cells 
from people with Alzheimer’s 
mutations this protein was 
more localized in membrane- 
bound sacs. 

Cell 146, 359-371 (2011) 


Fabric that puts 
out fire 


A specially treated cotton 
fabric can actively extinguish 
flames when it is set on fire. 

The material was produced 
by Jaime Grunlan at Texas 
A&M University in College 
Station and his colleagues by 
dipping the fabric repeatedly 
into two different solutions 
of water-soluble polymers, 
creating a coating just 
hundreds of nanometres 
thick. When heated to high 
temperatures, this layered, 
‘intumescent film reacts to 
form a carbon-based foam 
that lowers the amount of 
gas fuelling the burn and 
slows heat transfer, inhibiting 
ignition. Flame testing 
showed that whereas the 
material burned when coated 
with just five protective layers 
(pictured, left panel), 20 
layers were sufficient for it to 
put out the flame soon after 
ignition (right panel). 

Such coatings may provide 
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COMMUNITY 


The most viewed 


CHOICE papers in science 


Hot cities heat up China 


Urbanization is responsible for roughly 
24% of the warming observed in eastern 
China during the past three decades, 
thanks to the build up of heat in cities — 
known as the urban heat island effect. 

Baode Chen at the Shanghai Typhoon Institute and 
his colleagues analysed monthly mean temperature data 
from 463 meteorological stations in a range of urban and 
rural settings between 1981 and 2007. Using both raw 
and reanalysed data, the team found that urbanization 
contributed up to 44% of the temperature increase (almost 
0.4°C per decade) in metropolises with populations of more 
than 1 million. The largest increase in the urban heat island 
effect came after the early 2000s, coinciding with a period of 
rapid growth of cities in China. 
J. Geophys. Res. http://dx.doi.org/10.1029/2010JD015452 (2011) 


C3 HIGHLY READ 
on www.agu.org 
the week of 1 Aug 


of Cambridge, UK, studied 
developing embryos and 
placentas taken from mothers 
that had gone 24 hours 
without food about mid- 
way through gestation — a 
stage in pregnancy that is 
crucial for the development 
of a brain region called 
the hypothalamus. The 
authors found shifts in 
gene expression, including 
upregulation in the 
placenta of genes involved 
in autophagy, a process by 
which cells degrade their own 
components. Meanwhile, 
in the hypothalamus the 
expression of genes linked 
to neural development was 
enhanced. 

The authors suggest that 
the placenta breaks down 
its own tissues, recycling 


an alternative to chlorinated 
or brominated flame- 


retardant chemicals, which 
persist in the environment 
if they leach from fabrics, 
and show signs of toxicity in 
animal studies. 

Adv. Mater. http://dx.doi. 
org/10.1002/adma201101871 


(2011) proteins inside its cells to 
provide a steady supply of 

nutrients to the developing 
hypothalamus. 

Placenta protects pec. atl Acad. Sci USA 

the b rain http://dx.doi.org/10.1073/ 
pnas.1106022108 (2011) 

Far from being simply a For a longer story on this 


passive supplier of food, the research, see go.nature.com/ 


mouse placenta protects the ej2kxh 

developing fetus from brain 

damage if the mother is > NATURE.COM 

deprived of food. For the latest research published by 
Kevin Broad and Barry Nature visit: 


Keverne at the University Www.lature.com/latestresearch 
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Science’s pay gap 
In the United States, the pay 
gap between men and women 
is slightly smaller in science, 
technology, engineering and 
mathematics (STEM) than 

in other fields, according to a 
report released on 3 August 
by the US Department of 
Commerce. Women’s average 
hourly earnings are 14% lower 
than men’s in STEM jobs, but 
21% lower in non-STEM jobs. 
When the data are adjusted 

to control for other factors, 
such as education, the STEM 
pay gap shrinks to 12%, with 
physical and life sciences 
coming in at 8%. Women 
comprise 24% of STEM 
workers, a proportion that 
has remained constant over 

a decade. See go.nature.com/ 
mshimp for more. 


To Jupiter! 

NASAs Juno spacecraft 
launched successfully on 

5 August, bound for Jupiter. 
The US$1.1-billion mission 
will take five years to reach 
the Solar System’s largest 
planet, which it will swing 
round 33 times ina highly 
elliptical orbit. Juno will 
probe the Jovian atmosphere 
for water vapour, and look 
for signs of a solid core (see 
Nature 476, 13-14; 2011). 


China’s sea claims 
Tensions increased between 
Vietnam and China over 
exploration in the South 
China Sea, after China’s state 
media reported on 2 August 
that the research vessel Tan 
Bao Hao had surveyed the 
sea’s southwest basin in late 
July, with the aim of studying 
tectonic activity. Vietnam’s 
foreign ministry demanded 
that China “cease and refrain” 
from further such acts. 

The two countries — and 
other neighbouring nations 
such as the Philippines — are 


Streaks suggest liquid water on Mars 


NASA scientists have reported evidence of salty 
water near the surface of Mars, in a region that 
may become a landing target for future missions 
to determine whether the planet can support 
life. Images from the High Resolution Imaging 
Science Experiment (HiRISE), a camera aboard 
the Mars Reconnaissance Orbiter, revealed 


in dispute over their claims 

to potentially oil-rich islands 
in the region. China's recent 
funding boost for deep- 

sea exploration is largely 
motivated by the nation’s 
desire to find oil and minerals 
(see Nature 466, 166; 2010). 


NSF gets social 
The fourth of the cross- 
disciplinary ‘synthesis’ centres 
set up by the US National 
Science Foundation (NSF) 
will be the first to involve 
social scientists. So far, 
synthesis centres founded 
in 1995, 2004 and 2008 
have focused, respectively, 
on ecology, evolutionary 
biology and the interface 
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between mathematics and 
biology. The National Socio- 
Environmental Synthesis 
Center, in Annapolis, 
Maryland, will research issues 
such as food production, water 
availability and the interaction 
between human activities and 
ecosystem health, the NSF 
says. It announced funding of 
US$27.5 million over 5 years 
for the centre on 2 August. 


Ecology network 
Researchers behind a long- 
planned US$434-million 
environmental monitoring 
network released the 
project’s draft scientific plan 
on 4 August. The National 
Ecological Observatory 
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thousands of narrow, dark streaks that appear 
on some of Mars’ steep slopes during warmer 
seasons (pictured, in Newton Crater). By winter, 
the streaks have faded or vanished. The findings 
were published on 5 August (A. S. McEwen 

et al. Science 333, 740-743; 2011). See go.nature. 
com/casgrb for more. 


Network, which aims to collect 
data for 20 ‘core’ observatories 
representing ecosystems 
across the United States, 

had cleared its final funding 
hurdle a week earlier, when the 
National Science Foundation 
awarded money to begin 
construction. See page 135 

for more. 


Stem-cell chief 
The US National Institutes 
of Health has finally found 
a director for its Intramural 
Center for Regenerative 
Medicine, a US$52-million, 
7-year initiative begun in 
2010. Mahendra Rao, a 
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stem-cell biologist who spent 
six years as vice-president 

of regenerative medicine at 
Life Technologies (formerly 
Invitrogen) in Carlsbad, 
California, will take up the 
position later this month. His 
appointment was announced 
on 3 August. See go.nature. 
com/urlckh for more. 


Ex-NIH head dies 


Bernadine Healy, who was the 
first woman to lead the US 
National Institutes of Health 
(NIH), died on 6 August 

aged 67. Healy, who trained as 
acardiologist, headed up the 
NIH between 1991 and 1993, 
taking over after the agency 
had been without a director for 
two years. Her greatest impact 
was in championing research 
into women’ health — she 
launched the Women’s Health 
Initiative, a US$625-million 
clinical trial which, in 2002, 
showed that hormone- 
replacement therapy increases 
the risk of breast cancer, stroke 
and heart disease. 


Nobel laureate dies 


Immunologist Baruj 
Benacerraf (pictured), who 
shared the 1980 Nobel Prize 
in Physiology or Medicine for 
his work on the genetic basis 
of the immune system, died on 
2 August, aged 90. Benacerraf 
was born in Venezuela and 
brought up in France, but it 
was at New York University 
in the 1950s and 1960s that 


TREND WATCH 


Japanese astronauts clocked their 
collective 494th day in space on 
24 July, passing Germans, with 
493, to make Japan the third 
most space-faring country. As 

of 10 August it had reached 

511 days. Japan has no hope of 
catching up with Russia's 20,817 
days or the United States’ 14,820, 
but International Space Station 
schedules show that Japanese 
astronauts will keep a firm grip 
on third place. China will become 
the next contender for high 
honours later this decade, as it 
establishes its own space station. 


he traced how guinea pigs’ 
varying immune responses to 
the same antigen depended on 
specific genes. From 1980 to 
1992, he was president of the 
Dana-Farber Cancer Institute 
in Boston, Massachusetts. 


Fukushima’s wake 


Japan will next year create a 
more independent nuclear- 
safety agency, scrapping its 
existing watchdogs — the 
Nuclear and Industrial Safety 
Agency (NISA) and the 
Nuclear Safety Commission. 
Both currently sit within 

the pro-nuclear trade and 
industry ministry, but the new 
watchdog would sever ties 
with that ministry and might 
impose tougher regulations 
on nuclear plants, said 

Goshi Hosono, the minister 
in charge of the country’s 
response to the Fukushima 
meltdowns. He announced 
the plan on 5 August; one 


A NEW ORDER IN SPACE 


day after the government 
had fired NISA’s director and 
two other industry ministry 
officials. 


Nuclear closure 


Only two commercial 
facilities in the world recycle 
plutonium and uranium from 
used nuclear fuel to make 
‘mixed oxide (MOX) fuel: 
now one of them is shutting 
down. Britain’s Nuclear 
Decommissioning Authority 
on 3 August announced the 
closure of the MOX facility 
at Sellafield. The plant was 
supposed to supply Japanese 
reactors with MOX, but has 
been left without a market in 
the aftermath of the nuclear 
meltdown at the Fukushima 
Daiichi plant. See page 140 
for more. 


Integrity plans 


The US Environmental 
Protection Agency’s draft 
policy on scientific integrity, 
released on 5 August, drew 
immediate criticism. The 
watchdog group Public 
Employees for Environmental 
Responsibility, headquartered 
in Washington DC, called it 
“by far the weakest scientific 
integrity rules of any agency”. 
Among its concerns are that 
managers and public-affairs 
staff can screen and approve 
the release of scientific 
information. The US National 
Science Foundation released 
a draft policy on 4 August. All 


Japanese astronauts have passed 500 days in space — putting the 
nation third in the space-faring rankings. 
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SEVEN DAYS | THIS WEEK | 


11-12 AUGUST 

AUS Institute of 
Medicine committee 
examining the need for 
chimpanzee research 
holds a public workshop 
in Washington DC. The 
group is expected to 
release a report by the 
end of the year. 
go.nature.com/obxi2t 


11-18 AUGUST 
Results from detectors 
looking at cosmic 

rays — including 

the International 

Space Station’s Alpha 
Magnetic Spectrometer 
— should be revealed at 
the 32nd International 
Cosmic Ray Conference 
in Beijing. 


http://icre2011.ihep.ac.cn 


12-14 AUGUST 
Expect updates on the 
attempt to build an 
elevator into space, 
from the 2011 Space 
Elevator Conference in 
Redmond, Washington. 
go.nature.com/krqd81 


agencies had to submit policies 
to the White House Office of 
Science and Technology Policy 
by 5 August. See go.nature. 
com/m2q2zg for more. 


Data mining OK 


An update to Britain’s 
copyright laws means that 
scientists will be allowed — for 
research purposes — to mine 
text and data from journal 
articles, databases and other 
literature without having to ask 
permission from the copyright 
owner. Such text digitization 
and data mining gets a specific 
exemption under government 
plans to modernize the 
country’s intellectual-property 
laws, announced on 3 August. 
See go.nature.com/hekfgw 

for more. 


> NATURE.COM 
For daily news updates see: 
Www.nature,com/news 
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Political pressure to balance the US federal budget will have an impact on research. 


Debt deal sets day 
of reckoning 


US science agencies avoid immediate pain but could be 
devastated by automatic cuts in 2013. 


BY ERIC HAND 


calpel or guillotine? Those are the 
possible fates in store for US science fund- 


ing after Congress and the White House 
reached a deal to cut federal spending and raise 
the nation’s self-imposed debt limit before a 
2 August deadline. 

The product of tumultuous negotiations, 
the deal largely spares science in the short term 
but puts a day of reckoning on the horizon: 
2 January 2013. If politicians cannot agree on 


how to improve the government's fiscal out- 
look by then through targeted cuts and other 
means — the scalpel option — their failure will 
automatically trigger the guillotine: a deep cut 
applied across a range of expenditures, includ- 
ing research. In the worst case, the automatic 
cuts could mean shuttered laboratories and 
mass lay-offs at universities. 

Whether or not that happens depends partly 
on a special Joint Select Committee on Defi- 
cit Reduction, the ‘super-committee; to be set 
up as part of the deal. The super-committee, 


which congressional leaders must appoint by 
16 August, will be charged with finding ways 
of raising revenues or reducing the costs of 
entitlement programmes such as health care 
and social security, which the government is 
legally required to fund. If the super-commit- 
tee fails, the automatic triggers would force 
discretionary spending to fall under the indis- 
criminate blade of across-the-board cuts. 

“Then there will be extraordinary pain,’ says 
Michael Lubell, director of public affairs for the 
American Physical Society in Washington DC. 
“And it will get worse in 2014” 

The two-stage structure of the debt deal 
explains both the short-term reprieve and 
the long-term worry. The first set of agreed 
cuts, totalling US$917 billion, will be spread 
over 10 years, but two factors mitigate their 
effect. First, reductions to defence spending 
will account for a significant share of the cuts 
— meaning that other US agencies won't bear 
the entire burden. Second, the cuts are heavily 
loaded forward onto the 2014 fiscal year and 
beyond, in an apparent effort to shelter the cur- 
rent fragile economy. Only minimal cuts will 
be implemented in fiscal years 2012 and 2013. 

In fact, the cap for overall discretionary 
spending that legislators will use in finishing 
the 2012 appropriations process is $29 billion 
higher than the cap that House Republicans 
have used for their budget proposals so far 
(see ‘Fiscal tug-of-war’). This means that the 
National Institutes of Health (NIH) might 
yet escape cuts to its funding that previously 
seemed likely. The higher cap also leaves wig- 
gle room for Senate legislators to restore pet 
projects, such as NASA’s James Webb Space 
Telescope, which the House last month voted 
to cancel. 

The second set of cuts, totalling at least 
$1.2 trillion, could inflict much deeper 
wounds. The task of selecting precisely how 
to make up that amount will fall to the super- 
committee, a group of six Democrats and six 
Republicans, who will have the power to look 
beyond discretionary funding to consider 
entitlement cuts and new revenue streams 
from taxes. The super-committee has until 
23 November this year to formulate a plan, 

which will be put before 
Congress a month later. 


For ongoing If the super-committee 
US budget cart reach agreement, 
coverage see: or if Congress won't 


approve its plan, the 
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FISCAL TUG-OF-WAR 


Significant gaps between President Obama’s 2012 budget request and amounts approved 


by the House appropriators have yet to be reconciled, and some key 


questions answered, but the 2012 budget is far less painful for US science than what may lie ahead. 


Agency 


ational Institutes of 
Health 


ASA 


ational Science 
Foundation 


Department of 
Energy Office of 
Science 


ational Oceanic 
and Atmospheric 
Administration 


ational Institute 
of Standards and 
Technology 


COUNTDOWN T0 CUTS 


House speaker John Boehner and 


2011 enacted 
(US$ million) 
30,766 
18,448 


6,807 


4,858 


4,597 


750 


President Barack Obama face 
18 months of budget wrangling 
during an election campaign. 


16 AUGUST 2011 


Congressional ‘super-committee’ 


selected. 


30 SEPTEMBER 2011 


End of 2011 fiscal year and current 
budget. Stop-gap legislation is likely until 


a 2012 budget is passed. 


23 NOVEMBER 2011 


Super-committee proposes how to 
reduce budget by US$1.2 trillion. 


23 DECEMBER 2011 


Congress votes on super-committee’s 


proposal. 


FEBRUARY 2012 


President Obama tables 2013 budget 


request. 


6 NOVEMBER 2012 


Presidential and congressional elections. 


2 JANUARY 2013 


If super-committee proposals not 
accepted, across-the-board cuts to all 


federal agencies begin. 


Will the proposed National Center for Advancing Translational 


Will the US Senate restore funding to the over-budget James Webb 


How low will grant-acceptance rates go? A short-term increase from 
2009 stimulus funding is coming to an end. 


Does the Advanced Research Projects Agency — Energy programme 
have a future? It barely survived in the House, after a floor amendment 


Will the administration get a National Climate Service, a proposed 
consolidation of various climate-research activities? 


> across-the-board $1.2-trillion cut will auto- 
matically kick in a year later (see ‘Countdown 
to cuts’). 

Such an outcome would be catastrophic, says 
Lubell. According to his back-of-the-envelope 
calculation, the automatic cut would slash fund- 
ing for science agencies by 11%, starting in 2013. 
For the Department of Energy’s $5-billion 
Office of Science, even shutting down a national 
laboratory — for example Fermilab, the particle- 
physics laboratory in Batavia, Illinois, with its 
$300-million annual budget — would achieve 
only part of the mandated savings. Granting 
agencies such as the NIH and the National Sci- 
ence Foundation (NSF) would have to lower 
their grant-acceptance rates to single digits. 
The knock-on effect would be catastrophic for 
universities, which depend on grants to help 
cover post doctoral researchers and infrastruc- 
ture costs. “Can universities make up that gap? 
Nota chance,” says Barry Toiv, spokesman for 
the Association of American Universities, based 
in Washington DC, an advocate for the largest 
research universities. 

The global cut would also wreak havoc with 
the 2013 appropriations process, which would 
begin with President Barack Obama's budget 
request in February 2012. In sucha scenario, 
appropriators could play favourites among 
different science agencies, boosting some at 
the expense of others in anticipation of the 
looming cuts, says Lubell. 

Science agencies pursuing basic research 
that both parties tend to support, such as the 
NIH, the NSF and the core programme of the 
National Institute of Standards and Technology 
— could fare well. Basic research at the Depart- 
ment of Defense might also survive, as it offers 
little in the way of potential savings, being such 
a small piece of the overall defence pie. 

But ‘mission agencies, such as NASA, or the 
National Oceanic and Atmospheric Adminis- 
tration (NOAA), where science is a large but 
ultimately ancillary activity, could suffer. Worst 
off would be programmes supporting research 
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Will the Technology Innovation Program survive? The House would 
eliminate the $75-million industry partner programme, which didn’t 


2012 president’s 2012 House spending | Outstanding issues 
request (US$ million) bill (US$ million) 
31,829 Not yet put forward 
Sciences be approved by 1 October 2012? 
18,724 16,810 
Space Telescope? 
7,/68 6,860 
5,416 4,800 
was accepted by one vote. 
5,498 4,485 
1,004 701 
offer new grants in 2011. 


that is controversial in the current Congress, 
says Patrick Clemins, director of the research 
and development budget programme at the 
American Association for the Advancement 
of Science in Washington DC. This could spell 
trouble for climate-change research at NOAA 
and within NASAss Earth Science Division. The 
Department of Energy’s applied energy and 
loan-guarantee programmes could also be vul- 
nerable, as some members of Congress see them 
as picking winners in a part of the economy that 
should be governed by the free market. 

This is why science advocates are hoping 
that the make-up of the super-committee will 
favour the kind of negotiation that would head 
off across-the-board cuts. Toiv would like to 
see members selected from the Bowles—-Simp- 
son commission, which was appointed by 
Obama to study options for reducing the 
national debt. It emphasized infrastructure, 
education and basic research as important to 
the nation’s long-term economic health. But 
Lubell expects the super-committee to be 
entrenched along partisan lines: entitlement 
reform would be off-limits for the Democrats, 
whereas new taxes would be anathema for 
Republican members. Refusing to compromise 
would enable both parties to pander to their 
traditional constituencies in the November 
2012 general elections. 

April Burke, president of Lewis—Burke Asso- 
ciates in Washington DC, a science-lobbying 
firm, is more sanguine, saying that an impasse 
would not necessarily trigger doomsday cuts. 
Congress, a slippery beast, has devised ways 
to avoid automatic triggers before. And the 
2 January 2013 implementation of the man- 
dated cuts is a political lifetime away. By then, 
the 2012 election could have changed the politi- 
cal stage, along with its players. The economy 
could improve — or get worse. And new rules 
could be written. 

“I dont think we're looking at the full menu 
of what might happen then,” she says. “I think 
there might be yet another reality.’ = 


SOURCE: HOUSE APPROPRIATIONS COMMITTEE 


AP/P. MARTINEZ MONSIVAIS 


GROUND COVER 


Permanent research stations (red circles) are broadly 
representative of each of NEON’s 20 eco-regions. 
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US launches 
eco-network 


Ambitious project to systematically monitor the environment 
on acontinental scale is finally ready to break ground. 


BY JEFF TOLLEFSON 


eady or not, the era of big data is 
Renee to ecology. After years of dis- 

cussion and debate, the United States is 
moving forward with an environmental moni- 
toring network that promises to help transform 
a traditionally small-scale, local science into a 
continental-scale group enterprise. 

The National Ecological Observatory 
Network (NEON) will consist of 20 ‘core’ 
observatories representing distinct eco-regions 
throughout the United States (see map). These 
will be bolstered by temporary stations that 
can be relocated wherever data need to be col- 
lected. The sites will house equipment and host 
visiting researchers, while gathering a range of 
environmental data over at least three decades. 

The result will be a vast database that scien- 
tists can mine to tackle broad questions such 
as how global warming, pollution and land- 
use change are affecting ecosystems across 
the country. “NEON is really about trying 
to understand the biology of the entire con- 
tinent rather than the biology of a specific 
place,” says David Schimel, the project’s chief 


science officer, based in Boulder, Colorado. 
Conceived more than a decade ago, NEON 
has already spent just over US$80 million plan- 
ning the network and developing instruments, 
and hasa staff of about 140, including some 60 
scientists and engineers. But the project, which 
is run by an independent body — NEON, Inc. 
— didn’t clear its final hurdle until 28 July, 
when the National Science Foundation (NSF) 
awarded it $434 mil- 
lion over the next 


© 7 
decade, with $12.5 hein Sere 
million to be spent in 
the current fiscal year. F pe asm f 
The money could ys - d ° 
jump-start site eed ica 
preparationandcon- S29U1S tobea 


huge challenge.” 


struction as early as 
this year. Schimel 
says that NEON expects to begin work near 
its home base in Colorado and in the north- 
east and to expand from there. The first data, 
from sensors placed on towers, submerged 
in streams and buried in soils, should come 
next year — a trickle that project mem- 
bers hope will become a torrent by 2016, 
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when the project will be fully operational. 

“It’s a huge step forward,” says Sandy 
Andelman, an ecologist who heads a network 
of tropical ecology sites through Conservation 
International in Arlington, Virginia. But, she 
cautions, NEON will create a massive new data 
set that few environmental scientists know 
how to use today. “Being able to manage and 
process and make sense of those data is going 
to bea huge challenge,’ Andelman says. 

Once the entire network is up and running, 
some 15,000 sensors will work in concert with 
scientists on the ground to supply roughly 500 
distinct categories of data ranging from basic 
weather readings to concentrations of ozone 
in the air and nitrogen in the soils, leaves and 
streams. Scientists will collect tens of thou- 
sands of samples, including soil, water, plants 
and small mammals. At the same time, aerial 
surveys will analyse broader land-use trends 
as well as details such as leaf chemistry and 
carbon stocks, and satellite data will expand 
coverage over the entire continent. 

NEON’s draft scientific plan, released on 
4 August, identifies climate, land use and inva- 
sive species as drivers of ecosystem change 
that can be studied through their impact on 
biodiversity, biogeochemical and hydrologi- 
cal cycles and the spread of infectious diseases. 
By gathering data over large territories and 
long periods of time, NEON will aim to give sci- 
entists the statistical power they need to tease 
out subtle trends in a shifting ecological balance. 
For this reason, rather than focusing on regional 
investigations, the research stations are mainly 
designed to gather data in a uniform manner 
that can be used in larger network studies. 

“The big intellectual hurdle was the transi- 
tion from a series of largely regional sites to 
an integrated system,” says James Collins, an 
ecologist at Arizona State University in Tempe, 
who as head of the biological sciences direc- 
torate at the NSF during 2005-09 helped to 
mould the proposal into its current form. 

Over the years the project has battled 
against scepticism. Early on, Steven Wofsy, an 
atmospheric chemist at Harvard University in 
Cambridge, Massachusetts, and a pioneer of 
carbon studies at the Harvard Forest ecologi- 
cal research centre, feared that NEON would 
generate more data than value. Today he gives 
NEON credit for putting high-profile critics — 
including himself — on its scientific advisory 
board to address any problems head on. 

Wofsy remains sceptical of big science pro- 
jects that promise to open up a new field with 
lots of data that scientists haven't necessarily 
said they want. But he commends NEON for 
developing a plan to tackle scientific questions 
that will play out over decades. 

“If these guys are successful, they will 
have the goods on some really big ecological 
questions,’ Wofsy says. “Scientists who are 
interested in addressing those problems will 
have to make the effort to learn how to use 
those data.” m SEE EDITORIALP.I25 
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Ancient DNA reveals 
secrets of human history 


Modern humans may have picked up key genes from extinct relatives. 


BY EWEN CALLAWAY 


lain undisturbed for tens of thousands of 

years, ancient human genomics is mov- 
ing at breakneck speed. Barely a year after the 
publication of the genomes of Neanderthals' 
and of an extinct human population from 
Siberia’, scientists are racing to apply the 
work to answer questions about human 
evolution and history that would have been 
unfathomable just a few years ago. 

The past months have seen a swathe of 
discoveries, from details about when Neander- 
thals and humans interbred, to the important 
disease-fighting genes that humans now have 
as a result of those trysts. 

Neanderthals were large-bodied hunter- 
gatherers, named after the German valley where 
their bones were first discovered, who roamed 
Europe and parts of Asia from 400,000 years ago 
until about 30,000 years ago. The Neanderthal 
genome — shepherded by Svante Paabo at the 
Max Planck Institute for Evolutionary Anthro- 
pology in Leipzig, Germany — indicates that 
their evolutionary story began to split from 
the lineage of modern humans less than halfa 
million years ago, when their common ances- 
tor lived in Africa (see ‘The human straim). 
In December last year, Paabo’s team released 
the genetic blueprint of another population 
of ancient humans — unlike ourselves or the 
Neanderthals — that was based on DNA recov- 
ered from a 30,000-50,000-year-old finger bone 
found in a cave in Denisova in southern Siberia’. 
Palaeoanthropologists call these groups archaic 
humans, distinguishing them from modern 
Homo sapiens, which emerged in Africa only 
around 200,000 years ago. 

Paabo is amazed at how quickly the Nean- 
derthal genome has been mined. At a genomics 
meeting last year, for example, Cory McLean, 
a graduate student at Stanford University in 
California, was scheduled to talk immediately 
after Paabo presented the Neanderthal genome. 
Inspired, McLean had trawled through the just- 
released genome in the days before his talk. He 
discovered that Neanderthals, like humans, 
lacked a stretch of DNA that orchestrates the 
growth of spines on the penises of other pri- 
mates, and promptly presented the find just 
after Paabo presented his’. 

Since then, scientists have fleshed out the 


FE ora field that relies on fossils that have 


THE HUMAN STRAIN 


As Homo sapiens evolved and 
migrated across the world, they 
apparently interbred with archaic 
humans such as Neanderthals 


and Denisovans. 
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details of one of the biggest surprises from 
the Neanderthal genome: humans living 
outside Africa owe up to 4% of their DNA to 
Neanderthals. One explanation might be that 
humans migrating out of Africa mated with 
Neanderthals, probably resident in the Middle 
East, before their offspring fanned out across 
Europe and Asia. 

By comparing individual DNA letters in 
multiple modern human genomes with those 
in the Neanderthal genome, the date of that 
interbreeding has now been pinned down 
to 65,000-90,000 years ago. Montgomery 
Slatkin and Anna-Sapfo Malaspinas, theoreti- 
cal geneticists from the University of California, 
Berkeley, presented the finding at the Society 
for Molecular Biology and Evolution meeting 
in Kyoto, Japan, held on 26-30 July. 

Slatkin says that their result agrees with 
another study presented at the meeting that 
came from the group of David Reich, a geneti- 
cist at Harvard Medical School in Boston, Mas- 
sachusetts, who was involved in sequencing 
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i) Key dates of Homo 
sapiens presence 


MELANESIA 


both the Neanderthal and Denisova genomes. 
The dates also mesh with archaeological finds 
bookending early human migrations out of 
Africa to between about 50,000 and 100,000 
years ago. Reich's team is now developing tools 
to find signs of more recent interbreeding that 
might have occurred after humans arrived in 
Asia and Europe. 


MORE THAN GENES 

The denizens of Denisova also bred with 
contemporary humans, according to Paabo 
and Reich's analysis”. But the only traces of 
their DNA to be found in modern humans 
were in residents of Melanesia, thousands of 
miles away from Denisova, suggesting that 
the Denisovans had once lived across Asia. In 
2008, Paabo’s team set up a lab in Beijing to 
screen fossils that might contain Denisovan 
DNA, in the hope of learning more about them 
and their interactions with modern humans. 
Currently, the bone that yielded the Denisovan 
genome, and a single molar from the same 


MAX-PLANCK-INST. EVOL. ANTHROPOL. 


cave, are their only known fossil remains, but 
other archaic human fossils from Asia could 
bear traces of this group. 

Even before the Neanderthal genome made 
its debut in May 2010, scientists had argued that 
humans may have acquired not just DNA from 
archaic humans, but useful traits too. Human 
gene variants linked to brain development and 
speech were proposed as candidates, only to be 
scotched after closer inspection of the Nean- 
derthal genome. However, a study presented at 
a Royal Society symposium in London in June 
suggests that humans owe important disease- 
fighting genes to Neanderthals and Denisovans. 
Interbreeding endowed humans with a ‘hybrid 
vigour’ that helped them colonize the world, 
said Peter Parham, an immunogeneticist at 
Stanford University School of Medicine, Cali- 
fornia, at the symposium. 

Parham’s team compared a group of diverse 
immune genes — the human leukocyte antigen 
(HLA) genes — in Neanderthals, Denisovans 
and human groups from around the world. In 
several cases, Neanderthals and Denisovans 
carried versions of HLA genes that are abundant 
in modern humans in parts of Europe and Asia, 
but less common in Africans. Varying degrees 
of interbreeding could explain the mismatch, 
Parham says. He estimates that Europeans owe 
50% of variants of one class of HLA gene to 
interbreeding, Asians 70-80%, and Papua New 
Guineans up to 95%. 

“Tt does mean that some of us owe part of our 
immune-system function to Neanderthals,” 
says Paabo. However, John Hawks, a biological 
anthropologist at the University of Wisconsin- 
Madison, notes that many HLA genes pre-date 
humans’ split from Neanderthals and Denis- 
ovans, and that the differences may have arisen 
by chance as the groups evolved. 

Hawks, too, has been digging into the archaic 
genomes, and his team has already discovered 
that Neanderthals and Denisovans lack certain 


Most of the Neanderthal genome was sequenced 
from bones found in Vindija cave, Croatia. 
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forms of genes that may help modern humans to 
fend off epidemic diseases, such as measles. This 
is hardly surprising: the low population den- 
sity of hunter-gatherers meant that epidemics 
were unlikely, so they probably would not have 
benefited from these immune genes. 

But Hawks’s team is now using the find to 
test whether the defensive genes are linked to 
autoimmune diseases. In September, Hawks 
and his colleague Aaron Sams are scheduled 
to present data at a meeting of the European 
Society for the Study of Human Evolution in 
Leipzig, Germany, showing that the Denis- 

ovans lacked nearly 


“These genomes all of the gene vari- 
are publicly ants linked to coeliac 
available. disease, a gut auto- 
There’snothing immune disorder 
stopping high- present in modern 
school students humans. Hawks sus- 
from doing pects that the variants 
this.” may actually be in the 


same genes that are 
linked to epidemic resistance — if they are, 
further study could reveal how recently such 
autoimmune diseases arose in humans. 

Unlike most scientists mining the ancient 
genomes, Hawks has reported some of his 
more prosaic findings — Denisovans didn't 
have red hair, for example — on his blog (see 
go.nature.com/irclra). “These genomes are 
publicly available. There’s nothing stopping 
high-school students from doing this, and 
the kind of stuff that ’'m putting out on my 
blog is the stuff that a smart high-school stu- 
dent could do.” More significant (and closely 
guarded) insights will come from developing 
new methods for analysing ancient genomes to 
test hypotheses about evolution, he says. 

Paabo, Reich and the other scientists involved 
in sequencing the ancient genomes are eager 
to see others run with their data, but caution 
that they need to be aware of the limitations. 
“They're really terrible-quality genomes’, 
chock-full of gaps and errors and sections in 
which short stretches of DNA sequence have 
been put in the wrong place, says Reich. “There 
are a lot of traps in using these data, and if people 
are not careful they'll find all sorts of interesting 
things that are wrong.” Paabos team is working 
on improving the quality of the sequences and 
including data from more Neanderthals and — 
he hopes — Denisovans. 

Paabo says that he and his team regularly 
receive e-mails from scientists asking them 
questions about using the ancient genomes, 
which they have attempted to make as user- 
friendly as possible. But if the first year of 
ancient human genomics is any indication, 
these requests will multiply as scientists find 
new applications for the genomes. “Maybe 
we should write a little booklet called archaic 
genomics for dummies, Paabo says. = 
1. Green, R. E. et al. Science 328, 710-722 (2010). 


2. Reich, D. et al. Nature 468, 1053-1060 (2010). 
3. McLean, C. Y. et al. Nature 471, 216-219 (2011). 
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Chinese academies promise 
cleaner elections 


Researchers call for further reforms in selection process for prestigious membership. 


BY JANE QIU IN BEIJING 


eing elected as an academician in China 
B can be a passport to political influence 

and financial reward. Members of the 
Chinese Academy of Sciences (CAS) and the 
Chinese Academy of Engineering (CAE) are 
supposed to represent the cream of the coun- 
try’s research, and are often invited to sit on 
powerful government committees. In 
some provinces they are offered the same 
official ranking as a vice-minister, and are 
favoured with prize money, cars, houses 
and personal doctors. The pursuit of these 
prestigious posts can involve intensive lob- 
bying and even bribery. 

This year’s academy elections are already 
in full swing, and both societies are prom- 
ising to clean up the process through 
transparent and rigorous oversight. They 
are publishing their shortlists and inviting 
public comment at every stage of selection, 
and saying that they will take allegations of 
misconduct more seriously than ever before. 
This new openness is throwing prospective 
academicians under increasing scrutiny. 

Witness the case of Duan Zhenhao, a 
geologist at the CAS’s Institute of Geology 
and Geophysics (IGG) in Beijing. He was one 
of 314 candidates nominated to the CAS this 
year; two rounds of voting will whittle that list 
down to a maximum of 60. But, last month, 
after Duan was accused on an Internet forum 
of embezzling research funds, the CAS began 
an unusually swift investigation. Duan was 
detained by police. Both the CAS and the IGG 
declined to comment on the matter, and Duan 
did not respond to interview requests. 

Fang Shimin, a US-trained biochemist 
who runs a website (www.xys.org) exposing 
research misconduct in China, welcomes the 


> 


MORE 
ONLINE 


TOP STORY 


CAS’s quick response to the allegations, but 
says that investigations of alleged misconduct 
by nominees still lack rigour and transparency. 

Others say that the academy has made 
visible headway. “The CAS has significantly 
tightened up the election process in recent 
years,” says Mu-ming Poo, a neuroscientist 
at the University of California, Berkeley, and 
director of the CAS Institute of Neuroscience 


New academicians join a world of prestige and power. 


in Shanghai. “I can see obvious improvement 
in the quality of newly elected academicians.” 
(As a US citizen, Poo is not eligible for CAS 
membership.) Yao Tandong, director of the 
CAS Institute of Tibetan Research in Beijing, 
who was elected to the academy in 2007, 
agrees that the evaluation procedure for CAS 
academicians is now “probably one of the 
fairest systems in science in China’. 
Meanwhile, Xu Rigan, vice-president of the 
CAE, last month told the Xinhua news agency 
that his academy plans to improve its own elec- 
tion process to ensure that only those who have 
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made significant contributions to engineering 
or clinical science become academicians. The 
academy has been criticized for electing a high 
proportion of government officials and cor- 
porate executives, and turning a blind eye to 
attempts to exploit the system. Nominations 
can come from companies and government 
agencies, as well as from institutions, academic 
associations and the academicians themselves. 
These non-academic groups often run major, 
well-financed election campaigns, but the 
CAE is clamping down on such methods. In 
his statement to Xinhua, Xu said “those who 
cheat or try to use inappropriate means to 
get elected will lose their candidacy this year”. 

When the CAE announced in June that 
it had selected 163 candidates from a list of 
485 nominees, several nominated govern- 
ment officials and company executives had 
apparently failed to make the cut — among 
them the deputy mayor of Shanghai and the 
chairman of state oil giant Sinopec. Rejected 
candidates had not demonstrated adequate 
academic contributions, says Xu. 

But Gu Haibin, an economist at the 
Renmin University of China in Beijing, 
who has studied the system for ten years, 
believes that the academies’ efforts don't go 
far enough. “Promises to clean up the elections 
mask a deeper institutional problem,” he says. 

Gu argues that the current academician 
system reinforces guan ben wei, the belief that 
political power and status are more important 
than any other achievement. This means that 
many academics act as if the ultimate goal of 
their academic endeavours is political advance- 
ment, rather than producing quality research. 

“The whole system is rotten to the core,” 
says Gu. “Guan ben wei is deeply rooted in 
Chinese society and is poisoning all aspects 
of academic life.” m 
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Plutonium plans in limbo 


Fukushima’s economic fallout shutters Britain’s Sellafield mixed-oxide facility. 


BY EDWIN CARTLIDGE 


he Fukushima nuclear disaster, already 
"Tes a long shadow over the nuclear 
industry, has claimed another victim. 

Last week, Britain’s Nuclear Decommiss- 
ioning Authority (NDA) announced that it 
will close a troubled Sellafield facility that is 
one of only two commercial plants in the world 
producing mixed oxide nuclear fuel (MOX), 
after reactor shutdowns in Japan eliminated its 
only customers for the plutonium-containing 
fuel. “The reason for this [closure] is directly 
related to the tragic events in Japan and their 
ongoing impact on the power markets,” says 
Tony Fountain, the NDAs chief executive. The 
decision is a setback for the British nuclear 
industry and leaves the country wondering 
anew what to do with the world’s largest single 
stockpile of plutonium. 

MOX is created by combining uranium with 
plutonium extracted from spent reactor fuel or 
recycled from unused nuclear weapons. Pro- 
ponents say that it provides a way to squeeze 
more energy out of uranium, reduce the vol- 
ume of reactor waste, and destroy material that 
could otherwise be turned back into weapons. 
MOX accounts for 2% of the fuel burned in 
reactors around the world, 80% of it in France. 

Opened in 2001, the Sellafield MOX plant 
was originally designed to manufacture 560 
tonnes of MOX over 10 years. It would have 
allowed Britain to convert much of its plu- 
tonium stockpile, which now stands at 112 
tonnes. This was built up over years of repro- 
cessing spent fuel to extract plutonium, which 
was originally intended for use in fast-breeder 
reactors. 

But technical failures have repeatedly halted 
MOX production at Sellafield. The plant has 
produced just 15 tonnes of MOX, at a total 
cost of £1.4 billion (US$2.3 billion). Its closure 
leaves the Melox plant in Gard, France, oper- 
ated by energy company Areva, as the world’s 
only commercial supplier — it has produced 
about 1,500 tonnes of the fuel since 1995. 

Last year, things were looking up for the 
Sellafield MOX plant. The NDA, which has 
owned the plant since 
2005, reached an agree- 
ment with 10 Japanese 
electrical utilities that 
they would buy MOX 
fuel produced from Japa- 
nese plutonium stored at 
Sellafield. Chubu Electric 
Power in Nagoya was set 
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The end of MOX production at Sellafield puts Britain’s huge plutonium stockpile in the spotlight. 


to pay for upgrades at Sellafield’s plant and buy 
the first consignment of fuel for its Hamaoka 
power station, and TEPCO, the Tokyo-based 
utility company that operated the shattered 
Fukushima Daiichi reactors, would have taken 
half of all the Japanese fuel produced. 

But Sellafield has now lost its customers. 
In the wake of the nuclear disaster, only 17 
of Japan’s 54 nuclear reactors are currently 
operational, TEPCO is under serious financial 
and political pressure, and Hamaoka remains 
closed while tsunami protection is improved 
at the site. 

As well as leading to the loss of roughly 600 
jobs, the closure will intensify the debate over 
what to do with Britain’s plutonium stock- 
pile. Simply burying the plutonium deep 
underground would write off the energy that 
it could generate, and waste an undisclosed 
sum of money that has already been spent 
extracting it from spent fuel. But keeping 
it in indefinite storage is not an option, says 
Francis Livens, research director of the Dal- 
ton Nuclear Institute at the University of Man- 
chester. Over time, plutonium-241 decays to 
americium-241, which gives off fairly intense 
gamma radiation that would make handling 
the stockpile much more difficult. 

Consequently, the British government is con- 
sidering building a brand new MOX plant with 
a much higher output, with an announcement 
expected in the autumn. A report produced 
earlier this year by the University of Oxford's 
Smith School of Enterprise and the Environ- 
ment favoured renewed MOX production but 
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estimated the cost of a new plant at £2 billion 
to £3 billion. The government would be “crazy 
to consider building another MOX plant’; says 
energy-policy analyst Steve Thomas of the Uni- 
versity of Greenwich in London. MOX cannot 
be recycled indefinitely, so “all that reprocessing 
does is put off the day when we have to dispose 
of the waste, leaving the problem with people 
two generations from now”. 

A spokesman for Areva told Nature that 
the company does not expect the Fukushima 
accident to significantly reduce demand for 
MOX in the long term. He notes that Areva 
is currently building the United States’ first 
MOX plant, to recycle unwanted plutonium 
from nuclear weapons, at the Savannah River 
Site in South Carolina, and that China is also 
considering its own facility. “This says to us 
that MOX has a future,” he adds. 

But the Savannah River facility is hugely 
over budget, and will produce fuel that will 
cost about five times as much as conventional 
nuclear fuel, according to Frank von Hippel of 
Princeton University, New Jersey, an expert on 
energy policy and nuclear arms control. And 
although France is a stronghold of MOX pro- 
duction for now, he adds, its future depends 
on continued support for nuclear reprocessing 
from the French government. If the opposition 
Socialist Party wins next year’s presidential 
election, Areva’s position could be weakened, 
von Hippel argues. “Given the dominance of 
France in this field,” he says, “that would put 
a question mark over the future of MOX just 
about everywhere.” m 
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WHY 
FAKE IT? 


HOW ‘SHAM’ BRAIN SURGERY COULD BE 
KILLING OFF VALUABLE THERAPIES FOR 
PARKINSON’S DISEASE. 


BY ALLA KATSNELSON 
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eggy Willocks was 44 when she was diagnosed with Parkinson’s 

disease. It progressed quickly, forcing her to retire four years 

later from her job asa primary-school principal in Elizabethton, 

Tennessee. Soon, her condition had deteriorated so much that 
she was often unable to dress and feed herself, take care of basic hygiene 
or walk unaided across a room. 

Willocks enrolled in a trial for an experimental therapy called 
Spheramine, developed by Titan Pharmaceuticals, a biotechnology com- 
pany in South San Francisco, California. Spheramine consists of cultured 
human retinal epithelial cells bound to specialized man-made carrier 
molecules. The cells are implanted into the brain, where it is hoped that 
they will produce the dopamine precursor levodopa, which can reduce 
the symptoms of Parkinson's disease. In August 2000, Willocks became 
the second person ever to receive the treatment. After havinga steel halo 
— astereotactic frame — bolted to her skull, she was put under general 
anaesthesia. Surgeons then used the frame and coordinates obtained 
from numerous magnetic resonance imaging (MRI) scans to pinpoint 
the location at which to drill. They then snaked a catheter through her 
brain's white matter to deliver the cells into the striatum. 

At first there was no effect, but Willocks says that after 6-8 months 
she began to feel better. The changes were always moderate and grad- 
ual, except for once, about nine months after her surgery, when she 
showed what her doctor called a “radical” improvement in balance. By 
a year after the treatment, she and the five 
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a sort of historical precedent” for using placebo controls, but it may 
not apply to neurosurgical trials, he says. Willocks and other patients 
go further. Placebo-controlled studies aren't just unnecessary, they say, 
they are actually causing the downfall of potentially valuable treatments. 


ACOMPLICATING CONTROL 
During the past 25 years, surgical therapies for Parkinson's disease have 
travelled a rocky road. In 1987, a report by Mexican surgeons’ described 
seemingly miraculous effects in two patients with severe Parkinson's 
who received transplants of tissue from the adrenal gland, which pro- 
duces dopamine. In the next several years, hundreds of patients received 
the treatment, but some autopsies later showed that the cells didn't in 
fact survive*. Around the same time, researchers started to test fetal 
nerve-cell transplants (similar to those in Barker’s trial) in small-scale 
studies, finding mixed but promising results. Two studies*® comparing 
the treatment to sham surgery concluded, however, that the transplants 
were not only ineffective, but also often caused dyskinesia — the move- 
ment disorder that plagues people with Parkinson’s disease. In the past 
seven years, three experimental treatments (including Spheramine) 
that showed promise in small, open-label studies’”* failed in phase II 
trials**” comparing them with a sham control (see “The sham wall). 
Sham brain surgery is no sugar pill. After the stereotactic frame is 
affixed to the skull, the patient is usually anaesthetized and surgeons drill 
into the skull. In most cases, the burr holes 


other patients in the phase I trial showed an 
improvement in motor ability of 48%, and 
those gains largely held 4 years later’. 

Ten years on, she says she notices her 
condition worsening, but is still doing much 
better than she was before her operation. She 
has no doubt that the treatment works. Inves- 
tigators disagree: Spheramine was shelved in 
2008 after a follow-up phase II, double-blind 
study found that it was no more effective 
than placebo’. This time, the researchers 
compared the treatment with a ‘shany brain 


THERE’S A HISTORICAL 
PRECEDENT FOR USING 
PLACEBO CONTROLS, 
BUT IT MAY NOT APPLY 
TO NEUROSURGICAL 
TRIALS. 


stop at the dura mater, a protective mem- 
brane covering the brain, but they sometimes 
go deeper: ina phase II trial testing the nerve 
growth factor GDNE investigators catheter- 
ized the brain in all participants but infused 
saline, rather than GDNE into the controls’. 

“We have to stage the whole thing such that 
from the outside it’s completely indistinguish- 
able” from the real thing, says Joao Siffert, 
chief medical officer of Ceregene, a company 
in San Diego, California, that is working ona 
therapy that delivers a gene for another nerve 


surgery that copied almost every aspect of the 
procedure Willocks received, short of injecting cells into the brain. 

For many investigators aiming to treat Parkinson’s and other neuro- 
logical diseases invasively, using sham brain surgery as a control is, well, 
a no-brainer. And the practice is likely to expand in coming years, as 
researchers continue to develop experimental tissue transplants, gene 
therapies and stem-cell treatments. Small safety trials such as the one in 
which Willocks was enrolled may hint at the efficacy ofa treatment, but 
they are not designed to prove it. And because they are ‘open label’ — 
both the investigators and the participants know that the drug is being 
administered — they are riddled with biases that can skew results. “It is 
so clear that open-label studies provide information that is not reliable,” 
says Warren Olanow, a neurologist at New York’s Mount Sinai Medical 
Center who has worked on cell-based neurosurgical therapies in Par- 
kinson’s for more than two decades. “It’s almost impossible for me to 
imagine how a serious scientist can not desire their data or hypothesis 
to be tested in double-blind studies.” 

Other scientists, however, say that sham brain surgery is an expensive, 
potentially dangerous and possibly unethical bit of biomedical theat- 
rics. It may also be unnecessary. Clinical neuroscientist Roger Barker at 
the University of Cambridge, UK, contends that because there is huge 
variation in how these therapies are administered and in how patients 
respond, the protocols need to be refined in an open-label setting before 
going on to the next stage of development. And because cost, complex- 
ity and the small number of people eligible for such invasive therapies 
limit the size of the studies, a sham control provides results of limited 
statistical utility. Barker and his colleagues across Europe are currently 
enrolling patients in a €12-million (US$17-million) multicentre trial of 
a fetal dopaminergic nerve-cell treatment for Parkinson's disease. The 
treatment may never be tested against a sham-surgery control. “There's 


growth factor, called neurturin, using a viral 
vector. For many sham treatments, everyone in the operating room, from 
surgeons to nurses assistants, must pretend that they are busily perform- 
ing the complete operation — in some cases, turning on machines to elicit 
appropriate noises. An extremely complex protocol ensures that no one 
outside the surgical team knows who got what treatment. “It’s very com- 
plicated, there are a lot of moving parts,’ Siffert says. All that ratchets up 
the cost of a trial; Siffert estimates that between operating-theatre costs, 
follow-up and the unwieldy infrastructure required for data management, 
a 50-patient study would cost more than $10 million. 

Still, at least in North America, Parkinson's disease investigators over- 
whelmingly support the use of sham surgery — ata rate of 94%, accord- 
ing to a 2004 survey’®. Around 20% said that penetrating the brain is 
justifiable. And proponents say the procedure is relatively safe. Although 
sham brain surgery has definite risks, most notably those associated 
with general anaesthesia, supporters note that adverse events are almost 
unheard of, unlike the risks of the actual treatments. And participants in 
the sham groups are generally promised the treatment ifit is ultimately 
approved; in that event, they will already have the 
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on sham surgery: Sham treatments help to tease out the placebo 
go.nature.com/8seayr effect and biases. In Parkinson's disease, the pla- 


cebo effect is especially strong. One reason is that 
patients’ expectations that they will benefit from 
a treatment induce the release of dopamine", the 
neurotransmitter that is lacking in the disease. 
“The placebo effect is real, it’s huge and it’s got a 
physiological basis,’ says Jon Stoessl, a neurologist 
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THE SHAM WALL 


In the past decade, sham-controlled studies have dashed hopes for all but a 
few surgical treatments for Parkinson’s disease. 


Therapy Open label Sham controlled 

(phase 1) (phase II) 
Transfer of gene | 12 patients; 19% 45 patients; 23% 
for glutamic acid | improvement at 3 months | improvement in treatment 
decarboxylase that persisted at 1 year!* group vs 13% in controls at 6 
(GAD) months!° 
Implantation 6 patients; 48% 71 patients; no significant 
of cultured improvement at 1 year, difference after 1 year? 
retinal cells 43% improvement at 


(Spheramine) 4 years! 


Transfer of gene 
for neurturin 


12 patients; 36% 
improvement at 1 year’ 


58 patients; no significant 
difference after 1 year*, but 
18-month follow-up and 
autopsy data spurred a 
second trial, now ongoing 


Infusion of glial- 
cell-line-derived 
neurotrophic 
factor (GDNF) 


5 patients; 39% 
improvement after 1 year® 


34 patients; no significant 
difference after 6 months? 


Improvements are for off-medication unified Parkinson's disease rating scale (UPDRS) 
motor score. 


Canada, who studies Parkinson's and the placebo effect. In one double- 
blind study of fetal nerve-cell transplants, patient improvement correlated 
with whether they believed they had received the treatment, irrespective 
of whether they actually ha 2 And the effect can last as long as two years, 
Stoessl says, citing an unpublished study by his colleagues. 

Many regard bias as a more significant confounder. “Investigators 
have a tremendous vested interest in seeing that their treatment is effec- 
tive,’ says Anthony Lang, a neurologist at the University of Toronto in 
Canada who has participated in several neurosurgical trials for experi- 
mental Parkinson's therapies. In any trial, bias can affect how research- 
ers assess patient responses and may inflate the patients’ expectations, 
further enhancing the placebo effect. Compounding the problem for 
Parkinsons’ research is the fact that there are no objective measures for 
how well a patient is doing. “It’s just a sort of perfect storm conspiring 
against our ability to see definitive changes in the underlying disease,” 
says Steven Piantadosi, a clinical-trials methodologist at Cedars-Sinai 
Medical Center in Los Angeles, California. “Sham surgery, properly 
done, can control for that.” 

Barker counters that it is possible to control for investigator bias in 
an open-label trial by taking steps such as having blinded raters assess 
patients. His position is in some ways unsurprising; in Europe, sham 
surgery is deemed much less acceptable than it is in the United States. 
It has never been used in the United Kingdom. Barker is categorical 
about his belief that transplantation of fetal tissue works, at least for 
some people. “I don’t need sham surgery to show that,’ he says, pointing 
instead to a paper’? published last year describing two patients treated 
13 and 16 years previously who were still benefiting from the treatment, 
and whose brains showed functional dopamine-producing neurons at 
the transplant site. He attributes the mixed results in past studies to 
variation in the patients selected for treatment, the characteristics of the 
tissue being implanted and the methods used to implant it. His trial will 
have to demonstrate efficacy without eliciting some of the side effects 
found in the two sham-controlled studies. That will require some type 
of control study, he says, but it might take the form of comparison to an 
approved therapy that is known to work, such as deep brain stimulation. 

But time, says Barker, will best establish efficacy. In most trials the end 
point is no more than a year after the treatment. That may not be long 
enough: implanted cells or injected growth factors might take longer 
than this to become fully functional, and the placebo effect may not 
have had time to dissipate. “We want a 3-5-year endpoint,’ says Barker. 
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There are hints from some of the failed phase II trials that patients 
followed up beyond study endpoints might tell a more positive story’. 
Some say, therefore, that sham controls are sinking the prospects of valu- 
able drugs. Anders Bjorklund, a neuroscientist at Lund University in 
Sweden who is collaborating with Barker, says that sham surgery can lead 
researchers to throw out a strategy prematurely if the trial fails because 
of technical or methodological glitches rather than a true lack of efficacy. 


ADVOCACY AND FRUSTRATION 

According to Perry Cohen, who leads a network of patient activists 
called the Parkinson Pipeline Project, that’s exactly what is happening. 
He had always questioned the need for sham surgery, he says, but after 
the string of phase II failures, “We started saying, ‘Hey, this is a problem. 
These trials failed, but we know they are working for some people.” 

For researchers, it is easy to dismiss patients’ concerns as being driven 
by emotion. “Patients want cures,” says Lang, “and they will often be 
convinced that the more aggressive, surgical therapies are more likely 
to be curative.” But Cohen counters that patients have different priori- 
ties and that researchers must take these into account. Researchers use 
placebo controls to weed out false positives. But for patients, the real 
ogre is the false negatives — which can sinka therapy before it has been 
optimized. The better a trial is at stamping out the former, the higher the 
rate of the latter — which means at best delays, and at worst dead ends. 
Spheramine, for example, “is still on a shelf somewhere’, Cohen says. 
Then there's Amgen's phase II trial of GDNF. The trial was halted in 2004 
amid lacklustre results and potential safety concerns, which some have 
attributed to Amgen’s procedure, rather than to the therapy itself. Now 
researchers are taking a renewed interest in the molecule, but although 
Cohen is glad it is getting a second chance, “we lost 6 years on it’, he says. 

Patients also have different perspectives on risk from researchers, 
Cohen says. He offers the story of Tom Intili, who had had Parkinson's 
for 10 years when, at the age of 50, he signed on to the double-blind, pla- 
cebo-controlled trial of neurturin. At first, Intili improved dramatically. 
But when the results were unblinded, he learned that he had received the 
sham. His condition plummeted, leaving him more debilitated than he 
had been before the trial. “We just don't know what the psychological 
effects of unblinding are,” Cohen says. 

Moreover, trying to exclude the placebo effect is simply misguided, 
Cohen argues. “I don’t want to subtract out the placebo effect — I 
want to keep it, because in real life it’s part of the treatment,’ he insists. 
Because psychological factors are so salient in Parkinson's, a placebo 
response might actually potentiate a therapy, he explains. “I want to be 
convinced that sham surgery is necessary. I'm looking for arguments 
that might change my mind, but I haven't found any yet,” he says. 

Willocks says that she is living proof that many of the recently shelved 
therapies are in fact salvageable. Of course from a scientific perspective, 
her story is an anecdote, not data. In May, the failed phase I study of 
Spheramine — the therapy she received a decade ago — was finally 
published’. The paper closes with a warning about the dangers of the 
placebo effect and stresses the importance of controlling for it with a 
double-blind design. “That last paragraph bothered me,’ Willocks says. 
“T just don’t see how they can call it a placebo effect after ten years.” m 


Alla Katsnelson is a freelance science writer in New York City. 
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The arXiv server in the early 1990s: a computer that helped to change the world of physics. 


ArXiv at 20 


Paul Ginsparg, founder of the preprint server, reflects on two decades of sharing 
results rapidly online — and on the future of scholarly communication. 


wenty years ago this month, I 
Tins an electronic bulletin board 
intended to serve a few hundred 
friends and colleagues working in a subfield 
of theoretical high-energy physics. I had 
recently moved to the Los Alamos National 
Laboratory in New Mexico and for the first 
time had my own computer on my desk, and 
the desire to simplify the exchange of unpub- 
lished manuscripts (preprints) between 
researchers, previously distributed as paper 
copies by post. 
This automated repository and alert sys- 
tem for physics preprints, at hep-th@xxx. 
lanl.goy, was implemented shortly before 


the dawn of the web era. As I e-mailed toa 
colleague at CERN more than a year later: 
‘I know nothing of WWW, what is it?’ The 
original plan was for roughly 100 full-text 
article submissions every year, each stored 
for three months until the existing paper dis- 
tribution system could catch up. By popular 
demand, nothing was ever deleted. 

Within a few years it had evolved into a 
web resource at arXiv.org that now contains 
close to 700,000 full texts, receives 75,000 
new texts each year, and serves roughly 1 mil- 
lion full-text downloads to about 400,000 
distinct users every week (see graphs). It has 
broadened, first to cover most active research 
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fields of physics, then to mathematics, 
nonlinear sciences, computer science, statis- 
tics and, more recently, to host parts of biology 
and finance infiltrated by physicists. 

It is heartening, 20 years later, to see a 
stable and successful arXiv, running some 
of the original software and providing ser- 
vices to a community nearly a thousand 
times larger than expected. But at some 
point a thorough overhaul will be needed 
to keep pace with new online trends and 
opportunities. 

For me, the repository was supposed to 
be a three-hour tour, not a life sentence. 
ArXiv was originally conceived to be fully > 
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> automated, so as not to scuttle my research 
career. But daily administrative activities 
associated with running it can consume 
hours of every weekday, year-round without 
holiday. So, from September, the site will be 
entirely in the hands of the staff of Cornell 
University Library in Ithaca, New York. I will 
remain on the advisory board and continue 
some research projects in text and data min- 
ing and in supporting next-generation docu- 
ment formats and information filters. 

To reflect on ArXiv is to reflect on a 
research world transformed by a two-decade 
revolution in information technology, with 
vast quantities of literature and associated 
resources now available on demand. Yet, 
it is a surprise that scholarly publishing as 
a whole remains in transition. There is no 
consensus on the best way to implement 
quality control (top-down or crowd-sourced, 
or at what stage), how to fund it or how to 
integrate data and other tools needed for 
scientific reproducibility. 

My hope is that rather than merely using 
electronic infrastructure as a more efficient 
means of distribution, the revolution-in- 
waiting will ultimately lead to a more power- 
ful knowledge structure, fundamentally 
transforming the ways in which we process 
and organize scientific data. 


DEMOCRATIC GOALS 

The original bulletin board was engineered 
to level the research playing field. It is hard 
to imagine now, but considerable time and 
effort was once spent printing, photocopy- 
ing and posting preprints to a privileged few 
friends and colleagues, before publication in 
formal journals. The idea of a central reposi- 
tory was to allow any researcher worldwide 
with network access to submit and read 
full-text articles, giving equal entry to every- 
one from graduate students up. (The early 
Internet was an academic playground — the 
general public didn't start coming online 
until a few years later.) 

Within two years, arXiv had evolved into 
the primary daily resource for a global com- 
munity of researchers. It became a place 
to stake intellectual precedence claims, 
catalysing further growth. 

Launched in 1991, before any con- 
ventional journals were online, arXiv 
pioneered many of the tools now taken 
for granted. We led the way in using the 
abstract page as a hub to diverse formats 
and resources, linking author names auto- 
matically to search functions, and we were 
early adopters of electronic formats from 
compressed postscript to PDF for file shar- 
ing. The repository showed that research- 
ers were willing and eager to move to fully 
electronic means of dissemination. It also 
foreshadowed the ‘interactive web, insofar 
as it provided a rudimentary framework for 
users to deposit content. The value of this 


146 | NATURE | VOL 476 | 11 AUGUST 2011 


DIGITAL PIONEERS LEAD THE WAY TO SHARING RESEARCH ONLINE 


The popularity of the arXiv preprint server has grown inexorably since its launch in the early 1990s. 
Academics enjoy the universal access, low cost and speed of online distribution. 
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content is then amplified by sharing. 

In many ways, building the technology 
was simpler than managing the sociological 
and financial aspects. A decade ago, the main 
site moved with me and became embedded 
within Cornell University library. Although 
it serves more users on a daily basis than any 
other library resource, most of those users 
are external, so it is less clear where it should 
fit in university funding priorities. In the 
absence of a wealthy donor willing to pro- 
vide a small endowment in exchange for far 
more name recognition than any traditional 
building donation (hint, hint), the library 
has recently asked institutions who are heavy 
users to contribute to running costs (http:// 
arxiv.org/help/support). This distributes the 
financial burden and oversight to a larger 
community, while buying time to investigate 
long-term business models. 

Physicists were quick to adopt wide- 
spread sharing of electronic preprints, but 
other researchers remain reluctant to do 
so. Fields vary widely in their attitudes to 
data and ideas before formal review, and in 
choosing to share electronic preprints, each 


community will have to develop policies 
and protocols best suited to their users. A 
talk I gave in 1997 to a group of biologists 
helped catalyse the resource now known as 
PubMedCentral — run by the US National 
Institutes of Health. I served on the initial 
advisory board, which soon decided not to 
host any unrefereed materials, even carefully 
quarantined, in part for fear of losing essen- 
tial publisher participation. There remain 
many legitimate reasons for individual 
researchers to prefer to delay dissemina- 
tion, from uncertainty over correctness, to 
retaining extra time for follow-ups, to socio- 
logical differences in the way publication is 
regarded — in certain fields, the research 
somehow doesnt count until peer reviewed. 

No community that has adopted arXiv 
usage has renounced it, however, so the 
growth has been inexorable. Adoption by 
some fields, including computer science, 
started as a trickle, increasing dramatically 
many years later. Even some subfields of 
physics have experienced delayed adoption: 
emerging research in 2008 into superconduc- 
tivity in iron compounds brought in a group 
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of condensed-matter experimentalists tradi- 
tionally more cautious about disseminating 
results. They were ultimately won over by the 
need to stake precedence claims and get their 
results in front of theorists. 

Even today, fields vary hugely in how 
they recognize intellectual precedence. It 
baffles me that scientists in some fields can 
announce a result in a public forum, such as 
a meeting, while another group can repro- 
duce the results, publish first in a journal, 
and be given complete intellectual prec- 
edence, as though the information did not 
exist until vetted by the referee process. Jour- 
nal editors and referees should make more 
effort to ensure proper attribution is given 
to publicly accessible materials in a stable 
resource, such as arXiv. 


WHERE NEXT? 

As the arXiv community has diversified, so 
have its desires. Some users have requested 
support for comment threads related directly 
to papers on the site, while others prefer 
that it maintain its unadulterated stream of 
author-provided content. I have sympathy 
for more interactivity: in today’s social web, 
a one-way channel seems an anachronism. 
But because maintaining utility and civility 
in online discussions can be labour-inten- 
sive, our policy has been that such services 
should be external to the main repository. 
The same considerations apply to self-organ- 
ized, or ‘crowd-sourced’ forms of review. In 
recent years, some external blogs have begun 
to host useful comment threads, linked back 
from arXiv abstract pages through a track- 
back mechanism, but I don’t predict that 
dedicated blogging by individual scientists 
will grow substantially. More scalable tools 
for discussion are provided by question- 
and-answer sites such as mathoverflow.net, 
where expert mathematicians, in the course 
of answering one another’s posted ques- 
tions, provide links to maths articles hosted 
on arXiv. 

Again, because of cost and labour over- 
heads, arXiv would not be able to imple- 
ment conventional peer review. Even the 
minimal filtering of incoming preprints 
to maintain basic quality control involves 
significant daily administrative activity. 
Incoming abstracts are given a cursory 
glance by volunteer external moderators 
for appropriateness to their subject areas; 
and various automated filters, including a 
text classifier, flag problem submissions. 
Although the overall rate of such submis- 
sions is well below 1%, they tend to cluster 
in specific areas (such as general relativity, 
quantum mechanics and unified theories in 

physics; proofs of the 
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in mathematics; P versus NP problem in 
computer science). 

Moderators, tasked with determining what 
is of potential interest to their communities, 
are sometimes forced to ascertain ‘what is 
science?’ At this point arXiv unintentionally 
becomes an accrediting agency for research- 
ers, much as the Science Citation Index 
became an accrediting agency for journals, 
by formulating criteria for their inclusion. 
Although decisions are biased towards per- 
missiveness, inevitably some authors object 
that it is never permissive enough. 


DIGITAL GENERATION 

The idea that print journals had outlived 
their usefulness was already in the air in 
the early 1990s. David Mermin memorably 
wrote in Physics Today in 1991: “The time is 
overdue to abolish journals and reorganize 
the way we do business.’ By the mid 1990s, it 
seemed unthinkable that free and unfettered 
access to non-refereed papers on arXiv would 
continue to coexist indefinitely with quality- 
controlled but subscription-based publica- 
tions. Fifteen years on, researchers continue 
to access both, successfully compartment- 
alizing their different roles in scholarly 
communication and reward structures. 

The transition to article formats and 
features better suited to modern technology 
than to print on paper has also been sur- 
prisingly slow. Page markup formats, such 
as PDE have only grudgingly given way to 
XML-based ones that support features such 
as manipulable graphics, dynamic views, 
linked annotations and semantic markup. 
Part of this caution is a result of the under- 
standable need to maintain a stable archive 

of research literature, as 
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ers requires getting into 
the heads of current 
undergraduates and 
graduate students. Their life experience is 
of immediate online availability and global 
search engines, and they arrive imbued with 
the social-network mentality of sharing links, 
photos, videos and status updates. Yet, my 
own informal survey of graduate students 
reveals information-gathering techniques 
familiar to most older scientists. Students 
still follow citation trees, search by keywords 
and consult with peers and mentors, with 
the latter as important as ever for weeding 
out unreliable sources. Students also say that 
they search preferentially for open-access 
resources when working from home, because 
accessing subscription-based journals, even 
when available through an institutional 
proxy, can be frustratingly painful. 


transition.” 
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Navigating increasing quantities of data 
inevitably raises concerns of information 
overload. This phenomenon is documented 
back to the dawn of writing, accelerated with 
the invention of the printing press, and has 
been re-emphasized by every generation 
since. The superficial response is a call for 
better filters, but an imperfect filter can be 
more harmful than none. For example, com- 
monly used recommender systems based on 
passive measures of global popularity can 
broaden individual reading choices, but 
effectively broaden everyone in the same 
direction, thereby leading to less overall com- 
munity diversity. (The cynic would say rein- 
forcing faddishness in already faddish fields.) 

On arXiv, we have seen some of the unin- 
tended effects of an entire global research 
community ingesting the same information 
from the same interface on a daily basis. The 
order in which new preprint submissions 
are displayed in the daily alert, if only for a 
single day, strongly affects the readership on 
that day and leaves a measurable trace in the 
citation record fully six years later’. Some 
researchers, wise to this, time their submis- 
sions to arrive just after the daily afternoon 
deadline to maximize their prominence 
in the next day’s mailing. Filters that high- 
lighted ‘popular’ materials over longer peri- 
ods of time would exacerbate this effect. 
Hence any recommender system on arXiv 
would need, at minimum, to be personal- 
ized to individual readership preferences 
and interests to reduce herding behaviour. 
Experiments with such systems are ongoing, 
and may be put online within a year or two if 
they perform properly. 

For now, the open questions of arXiv’s 
long-term role and its relationship to 
conventional publishing, the details of its 
funding model, and its overall intellectual 
supervision, are to be resolved in coor- 
dination with its users and stakeholders. 
A meeting of international sponsoring 
institutions will be hosted by the Cornell 
Library next month to discuss the transi- 
tion of arXiv to a collaboratively governed, 
community-supported resource. It will be a 
challenge to keep it attuned to the needs of 
future generations of researchers. 

My hope is that the barrier to implementa- 
tion of new ideas in this realm will remain 
low enough that, ifall else fails, some young 
researcher elsewhere can launch another 
tiny ship on a fateful trip. m 


Paul Ginsparg is at the Department of 
Physics, Cornell University, Ithaca, New 
York 14853, USA. 

e-mail: ginsparg@cornell.edu 
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The Lorax leaves in despair after failing to save his ecosystem from poisoning by the Thneed factory, run by the greedy, green-armed Once-ler. 


IN RETROSPECT 
The Lorax 


Emma Marris reflects on a classic children’s fable that 
still has lessons for environmental policy 40 years on. 


r | AVheodor Seuss Geisel, better known as 
Dr Seuss, wrote more than 40 chil- 
dren’s books, beloved for their zany 

rhymes and sinuous illustrations. In August 

1971 — the year after the creation of the 

US Environmental Protection Agency and 

celebration of the first Earth Day — Seuss 

published a book that became a kind of Silent 

Spring for the playground set. 

Thousands of children have learned 
about environmental destruction from the 
The Lorax, Seuss’s tale of ecological ruin 
brought on by greed. The book still reso- 
nates: Universal Studios is due to release a 
feature-length animation of it next year. It 
packs in a lot of sophisticated concepts for a 
picture book, from the interconnectedness 
of ecosystems to the effects of industrial pol- 
lutants on freshwater systems. There is even 
a trophic cascade — a shift in top predators 
that triggers changes through a food chain. 
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And what initially seems like a simplistic 
take on environmental policy — industry 
bad, activists good — turns out to be more 
subtle. The hero does not save the day; that 
task falls to the next generation. This down- 
beat, if realistic, plot arc makes me hesitant 
about introducing the book to my young 
daughter. 

An ecologist might classify the book’s lost 
paradise as a “Truffula savannah’ The key- 
stone species are the Truffula trees, which 
look like candy-coloured palms. In the story, 
every last one is chopped down by a face- 
less entity, the Once-ler, to provide the raw 
material for a multipurpose garment called 
a Thneed — anticipating Snuggies and 
Slankets by some four decades. 

Clearing the Truffula trees sets off a chain 
reaction that demonstrates the interdepend- 
ence of life. Without Truffula fruits, the ursine 
Brown Bar-ba-loots have nothing to eat. 
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This being a children’s 
book, they don’t go 
extinct. Rather, they 
are packed off, with 
much lamentation, to 
points unknown by 
the creature who gives 
the book its title: the 
Lorax, a diminutive, 
grandly moustached 
character who acts as 
advocate for the eco- 
system's species. 

The Lorax also 
complains about the 
unregulated Thneed 
factory, which belches 
out smog and dumps 
into a pond an astonishing quantity of indus- 
trial by-products known as Gluppity-Glupp 
and Schloppity-Schlopp. (According to the 
US National Resources Defense Council, 
textile factories pollute 200 tonnes of water 
per tonne of fabric produced.) The smog 
chases off the ecosystem’s avian endemics, 
the Swomee Swans, and the polluted water 
gums the gills of the Humming Fish, which 
must traipse away in search of cleaner ponds. 
Eventually, the bright Truffula savannah is 
replaced by a lifeless wasteland. 

Naturally, the Once-ler gets his 
comeuppance. By harvesting too many Truf- 
fula trees too quickly, he put himself out of 
business and retreats to his ruined factory 


The Lorax 

DR SEUSS 

First published by 
Random House: 1971. 
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to ruminate on the costs of not having a 
sustainable business plan. 

The Lorax leaves in despair, and the 
Once-ler hands over the task of restor- 
ing the Truffula ecosystem by giving the 
world’s last Truffula seed to a child. 

The Lorax himself is a parody of a mis- 
anthropic ecologist: “He was shortish. 
And oldish. And brownish. And mossy. 
And he spoke with a voice that was sharp- 
ish and bossy.’ He hectors and pleads, he 
“speaks for the trees’, but the Once-ler 
pays no attention until it is too late, say- 
ing, “All you do is yap-yap and say, ‘Bad! 
Bad! Bad! Bad!’” That approach seems a 
little dated in this era of ‘win-wir solu- 
tions. But Seuss understood, even back 
then, the limits of gloom and doom. The 
Lorax fails. Nevertheless, Seuss clearly 
had great affection for his impassioned 
little nag, as do the book's legions of fans. 

Perhaps the source of its enduring 
appeal has less to do with Seuss’s pres- 
cience about the futility of ecological 
doomsaying than with environmen- 
talists’ nostalgia for a time when such 
problems seemed more black and white. 
The Lorax portrays a world without the 
complexities of carbon trading, the pric- 
ing of ecosystem services, uncertainties 
over baseline states and the existence or 
not of pristine wilderness. For Seuss, wild 
nature is a paradise, industry is a malig- 
nant cancer and heroes take a stand. 

Then again, that old narrative has not 
been entirely displaced by the bureaucrats 
who set green targets at international 
meetings. Diverse natural land is threat- 
ened by industries; smog chokes skies; 
Gluppity-Glupp fouls waterways; people 
use natural resources like there is no 
tomorrow. We still need heroes to speak 
for the trees. 

Will I read my daughter The Lorax 
when she graduates from Pat the Bunny? 
Tm not sure. It captures the basics of why 
mismanagement and overexploitation of 
ecosystems are a bad idea in a way that 
children can understand. And because 
I believe that her generation’s challenge 
will be to manage the planet consciously, 
perhaps it is not too early to teach those 
lessons. But this is a gloomy book. The 
final image — of the Truffula seed 
hurtling into a tiny pair of hands — puts 
a lot of responsibility on small shoul- 
ders. Perhaps it is more important for 
her to learn the pleasures and beauty of 
nature first. Afternoons spent poking in 
the mud, catching cicadas and mapping 
out squirrel routes may be more likely to 
turn her green than the spectre of a world 
without Truffula trees. = 


Emma Marris writes for Nature from 
Columbia, Missouri. 
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How Earth shaped up 


Andrew Robinson enjoys an account of the first 
expedition to the equator to calibrate latitude. 


hat was the world’s first inter- 
national scientific expedition? 
Scientist-explorers of the 


Enlightenment, such as Alexander von 
Humboldt, would have answered without 
hesitation: the French government's eight- 
year geodesic mission to the Spanish colony 
of Peru, beginning in 1735. The mission — 
the subject of Measure of the Earth — cal- 
culated the precise length of a degree of 
latitude at the equator, thereby helping to 
define Earth’s shape. 

Today, the geodesic mission’s fame has 
faded. Until this book, no modern history 
of the mission has been published in Eng- 
lish (unpublished academic studies exist). It 
was virtually ignored in Ken Alder’s prize- 
winning account of the surveying of the arc 
of the meridian and the origins of the met- 
ric system, The Measure of All Things (Free 
Press, 2002). Neil Safier’s Measuring the New 
World (University of Chicago Press, 2008) 
focused instead on the Amazon explorations 
of its best-known member, Charles-Marie 
de La Condamine. But in 2006, the French 
geodesic mission was the basis for one epi- 
sode in a BBC television series, Voyages of 
Discovery, for which the historical consult- 
ant was science writer and naval historian 
Larrie Ferreiro. In 


Measure of the Earth, NATURE.COM 
Ferreiro puts this Formoreon 
important, dramatic measuring the 
and gruelling trip back _ meridian: 

on the map. go.nature.com/xrgrvs 


The geodesic mis- 
sion was organized by 
the French Academy 
of Sciences, under the 
sponsorship of the 
French minister of the 
navy acting in wary 
collaboration with his 
4 Spanish counterpart 
in Madrid. Spain sent 
two naval officers, also 
astronomers, to learn 


Measure of 


the Earth: The 
Enlightenment 


Expedition That from and keep an eye 
Reshaped Our on the French. 
World 


Along with a simi- 
lar French scientific 
mission to the Arctic 
Circle in 1736-37, 
the expedition deter- 
mined that a degree of latitude is shorter at 
the equator (where it measures a length of 
56,753 toises, or 110.54 kilometres) than at 
Paris (57,060 toises) or towards the North 
Pole (57,437 toises). Thus it proved that 
Earth was not a sphere but an oblate sphe- 
roid: it bulges slightly at the equator and is 
slightly flattened at the poles. 

This asymmetry explains why the force of 
gravity is slightly less near the equator than 
in Paris. This was first observed by a puzzled 
French astronomer measuring the period of 
a pendulum clock in Guyana in 1672, and 
seized on by Isaac Newton in his Principia 
Mathematica of 1687. Newton predicted 
the oblate shape of Earth from his theory of 
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The 1730s French geodesic mission revealed that Earth bulges at the equator 
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gravity. However, his idea was strongly 
contested in France by followers of 
philosopher René Descartes, whose 
theory of vortices in the ether had led 
them to the view that Earth must be 
prolate, like an upright egg. The geodesic 
mission thus settled a long scientific 
debate, and at the same time improved 
the accuracy of navigation by ships. 

By 1735, the basic geometric and 
astronomical principles of trigonomet- 
ric surveying were well understood. To 
the government and scientists in Paris, 
it seemed straightforward to apply 
these accurately among the peaks of the 
Andes, over a period of a year or two. 
But the expedition members had reck- 

oned without a 


“The expedition myriad of obsta- 
had reckoned cles, large and 
without small, includ- 
obstacles ing precarious 
including mountain trans- 
yellow fever, port, extreme 
local love weather, earth- 
affairs and quakes, altitude 


sickness, yellow 
fever, local love 
affairs and brigandry. They suffered the 
theft of their triangulation signals by 
local Indians, damage to instruments, 
the withdrawal of their French funding, 
Spanish colonial politics and the rip- 
ples from European wars, as well as the 
murder of their surgeon at a bullfight. 

There were also purely scientific chal- 
lenges, such as stellar aberration: the 
then little-understood phenomenon 
of tiny variations in the location of the 
stars used to fix latitude, which we now 
know arises as a result of Earth’s motion 
around the Sun. 

The most serious problem was the 
expedition’s lack of leadership. Interne- 
cine squabbling nearly scuppered the 
project. The precious results announced 
to the French Academy in 1744 by expe- 
dition member and geophysicist Pierre 
Bouguer were those measured by him- 
self and La Condamine, without the 
knowledge of the official expedition 
leader, Louis Godin, who decided to 
remain in South America as a profes- 
sor of mathematics in Lima. Measure of 
the Earth accomplishes its mission with 
skill and devotion, although it lacks some 
necessary diagrams. Its intermixing of 
politics and science is particularly fasci- 
nating. It also shows, unintentionally, the 
astonishing development of international 
scientific cooperation since the days of the 
Enlightenment pioneers. = 


brigandry.” 


Andrew Robinson is author of The 
Story of Measurement. 
e-mail: andrew.robinson33@virgin.net 
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Head to head 


Ascenario blaming rats for the devastation of Easter Island 
doesn’t account for recent results, argues Paul Bahn. 


aster Island, or Rapa Nui, isa perennial 
Hisense of scholars and the media 

because of its numerous giant stone 
figures and supposed mysteries. Most of these 
enigmas — including the origins of the stat- 
ues and the denuded landscape — have been 
solved in recent decades through painstaking 
work by archaeologists, anthropologists, envi- 
ronmentalists, linguists and geneticists. 

A consensus view has emerged, summa- 
rized in Easter Island, Earth Island (Thames 
and Hudson; 1992) by botanist John Flenley 
and myself, that the island was deforested 
by its inhabitants. They cleared the land 
for crops and used timber for the transport 
and erection of ever more statues, with war 
the end result. In The Statues That Walked, 
archaeologists Terry Hunt and Carl Lipo 
present a different picture, portraying the 
islanders as environmentally sensitive and 
peace-loving until Europeans arrived in the 
eighteenth century. But some recent publica- 
tions don’t support their hypothesis. 

Since the first known visit to the island by 
European vessels in 1722, people have won- 
dered how so many huge stone statues could 
be transported and raised, given the lack of 
available timber. Discoveries of root moulds, 
pollen grains and stumps showed that the 
island was originally covered in millions of 
large palms and other tree species. Analyses of 
pollen and plant macrofossils revealed dras- 
tic deforestation between the thirteenth and 


2011 
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Debate surrounds the transportation of Easter Island’s statues and the disappearance of its forests. 


seventeenth centuries, 
long before the Euro- 
peans arrived. 

A wide range of evi- 
dence, along with oral 
traditions, suggest that 
the Easter Islanders 
had lived cooperatively 


for centuries after their 
The Statues arrival from Polynesia, 
That Walked: probably in the early 
Unraveling the centuries AD. A thou- 
Mystery ofEaster sand years later, they 
Island were in conflict, living 
TEE LONE in a barren landscape. 
chair tea The ecological decline 
Free Press: 2011. 
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serves as a warning 
of what is happening to Earth as a whole, as 
argued by Flenley and myself, and by geogra- 
pher Jared Diamond in his best-seller Collapse 
(Viking, 2004). 

Inevitably, this scenario has elicited 
doubts. Archaeologists Catherine and 
Michel Orliac have speculated that the 
island’s deforestation was largely due to 
drought or climatic change, which may 
indeed have played a part. Others, including 
Hunt and Lipo, contend that the islanders’ 
ills were caused by the arrival of Europeans, 
rather than internal social pressures. 

In The Statues That Walked, Hunt and Lipo 
argue for alate date ofaround 1200 ap for the 
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islanders’ appearance on Rapa Nui. They 
claim that deforestation was mainly caused 
by the rats that came with them. Having 
found some palm nuts bearing gnaw- 
marks, they attribute the extinction of the 
island’s big palm to rat predation, although 
they say little about other tree species. The 
statues, they posit, were moved upright for 
many miles by swivelling, which required 
little timber. And despite the deforestation, 
they say, the islanders continued to grow 
sufficient food and remain free of quarrels 
until Europeans brought violence, germs 
and eventual devastation. 

The authors’ new scenario does sup- 
port the consensus view that the island 
was only colonized once, by Polynesians 
— and not, contrary to the theory of 
Norwegian adventurer Thor Heyerdahl, 
by Amerindians from the New World. 
The book contains good passages on the 
carving and transportation of the statues 
— which we know from oral testimony 
to be ancestor figures that were vener- 
ated by the islanders — including an 
account of the pioneering excavations of 
the statue roads by US geologist Charlie 
Love. The practice of lithic mulching, in 
which millions of stones are spread over 
the soil to retain moisture for crops, is 
described in detail. 

But coverage of work by others is 
incomplete. For instance, the authors 
mention only their own survey of the 
statues and not the decades-long (and 
ongoing) cataloguing by US archaeologist 
Jo Anne Van Tilburg. Nor do they note 
some recent published evidence that, in 
my view, refutes the book’s basic tenets. 

For example, a variety of evidence 
contradicts their claim of rat predation: 
numerous palm fruits not gnawed by 
rats, palm stumps burned and cut, con- 
tinued germination of palms despite the 
rats’ presence, and the disappearance 
of other plant species that coexist with 
rats elsewhere. Hunt and Lipo’s claim 
that human skeletal remains show lit- 
tle evidence of lethal trauma is refuted 
by quotes from anthropologist Douglas 
Owsley, the author of a 1994 paper that 
they reference. After examining more 
than 600 Easter Island skeletons, Owsley 
stated in a 2003 BBC documentary that 
the extreme frequency of injuries proved 
that these were people at war: “They’re 
slugging it out, there’s no doubt about it.” 

Hunt and Lipo present some of the 
island’s many features entertainingly, but 
the history of Rapa Nui is more complex 
than they allow. m 


Paul Bahn is the author (with John 
Flenley) of Easter Island, Earth Island. An 
updated third edition is out this month. 
e-mail: pgbahn@anlabyrd.karoo.co.uk 


The Brainbow technique colours individual neurons, here ina slice of mouse hippocampus. 


Picturing the soul 


Alison Abbott revels in a stellar mix of brain imageries. 


enerating beautiful images has never 
(S= the exclusive preserve of art: 

scientific representations of the brain 
have aesthetic value too, as portrayed in an 
exhibition at the German Hygiene Museum 
in Dresden, co-curated with the Moravian 
Gallery in Brno, Czech Republic. Images of 
the Mind presents more than 200 artistic and 
scientific works depicting the mind from 
medieval times to the present day. It also 
illustrates how the evolving imagery of artists 
has always been firmly rooted in contempo- 
rary scientific knowledge. 

Star names among the artists exhibited 
include Rembrandt, Leonardo da Vinci, 
Lucas Cranach the Elder and Albrecht 
Diirer, as well as luminaries Caspar David 
Friedrich and Edvard Munch, and contem- 
porary artists Bill Viola and Antony Gorm- 
ley. Unusually, works by relatively unknown 
artists from central Europe are also on show: 
Bohumil Kubista’s 1911 Epileptic Woman is 
a masterly portrayal of emotional torment. 

No less of a draw is the historical scientific 
imagery. Sketches by 1906 Nobel prize- 
winners Camillo Golgi and Santiago Ramon y 
Cajal are paired to illustrate a legendary 
scientific dispute. When they examined brain 
tissue under a microscope, each saw — and 
drew — a different structure. Golgi sketched 
a continuous web of cells; Cajal correctly 
depicted individual cells, or neurons. 

Also on view are newly discovered draw- 
ings by neuroanatomist Korbinian Brod- 
mann, who mapped the human cerebral 
cortex in 1908, and stunning images cre- 
ated using Jeff Lichtman’s and Joshua Sanes's 
Brainbow technique. Developed at Harvard 
University in 2007, the method uses differ- 
ent colours to pick out individual neurons 
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in a brain slice. 

The curators match 
the quality of the gath- 
ered objects with a 
fascinating narration 
of the history of mind imagery. The earliest 
exhibits are eleventh-century manuscripts 
depicting the Aristotelian understanding 
of cognitive processes. In the Renaissance, 
anatomists dissected corpses and drew, beau- 
tifully, what they saw, providing templates for 
artists to paint more realistically. Portraitists 
quickly attempted to go further, to capture 
the soul of their subjects as well as their exter- 
nal proportions. Rembrandt’ series of tiny, 
detailed self-portraits in different emotional 
states encapsulates this perfectly. 

Depictions of the mind changed again 
in the twentieth century, when Sigmund 
Freud divided the psyche into the conscious 
and unconscious. Artists such as Kubista 
began to explore the fractured mind more 
abstractly — by then, photography was in 
any case capturing the realism to which they 
had formerly striven. 

During the past few decades, scientists 
have described the brain in ever greater 
electrical, molecular and anatomical detail. 
Artists have responded, often question- 
ing whether the wet, electrical mass of the 
physical brain could alone host the mind. 
Perhaps they are anticipating new neurosci- 
entific understanding of the powerful ways 
in which environment shapes the brain. For 
both artists and scientists, the exhibition is a 
reminder that the mind remains a mysterious 
moving target. m 


Images of 

the Mind 
German Hygiene 
Museum, Dresden. 
Until 30 October. 


Alison Abbott is Nature’ Senior European 
correspondent. 
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ORRESPONDENCE 


On the care and use 
of US lab animals 


Some recommendations in the 
latest Guide for the Care and 
Use of Laboratory Animals (see 
go.nature.com/i9y7fr) have 
stirred up controversy in the 
research community. As former 
members of the committee that 
updated the guide, we would 
like to comment informally 

on these objections (see 
disclaimer). 

The guide was released 
last year by the US National 
Research Council. 
Recommendations may apply 
to institutions funded by the 
US Public Health Service, but 
will, however, be used by the 
Association for the Assessment 
and Accreditation of Laboratory 
Animal Care International, a 
non-profit organization based 
in Frederick, Maryland, to 
evaluate and accredit research 
institutions. 

Because of its many users 
and settings, the guide is 
written in broad statements 
of desired outcomes 
(performance standards) and 
is largely devoid of detailed 
specifications. This enables 
research institutions to create 
customized procedures and 
programmes within the context 
of the recommendations 
for high-quality care, from 
external acquisition or in-house 
breeding of lab animals to their 
final disposition. 

The latest edition of 
the guide expands on this 
approach, partly on the basis 
of overwhelming support for 
performance standards by the 
research community. Despite 
this, some are urging the 
National Institutes of Health’s 
Office of Laboratory Animal 
Welfare (OLAW) not to adopt 
the eighth edition, spurred 
by fears over how OLAW will 
interpret and implement the 
guide's recommendations. 
OLAW is currently considering 


implementation of the guide 
and is due to issue a position 
statement soon. 

Objections mainly 
focus on the guide's space 
recommendations, particularly 
for breeding rodents. Optimal 
space requirements depend on 
factors including strain, sex, 
age, enrichment and animal 
psychology. The guide points 
out the necessity for study 
and assessment of the space 
requirements of laboratory 
species, while leaving previous 
recommendations essentially 
unaltered. Emphasis is placed 
instead on performance 
standards that are critical for 
deciding on minimal cage 
space. 

Lobby groups representing 
commercial rodent vendors 
(among others) estimate 
that compliance with this 
new standard will cost 
upwards of US$500 million. 
But many institutions 
have already implemented 
performance standards to 
prevent overcrowding or 
are voluntarily using larger 
breeding cages. Standards 
for rodent husbandry should 
not be bound by commercial 
profitability. 

Further research is needed 
into space and housing 
requirements and the provision 
of enrichment, exercise and 
human contact. We believe that 
the latest recommendations 
and a performance-standards 
approach will eventually help 
to define the highest welfare 
standards for research animals. 
Janet C. Garber on behalf 
of 13 co-signatories*, Garber 
Consulting, Pinehurst, North 
Carolina, USA. 
jgarber@nc.rr.com 
*See http//dx.doi.org/476152a for 
a full list of signatories. 


Disclaimer: Opinions expressed in this 
Correspondence are strictly those of 
the authors, written in their personal 
capacity. They do not represent the 
views of colleagues, current or past 
employers, or any other body. 
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Rule on papers puts 
China’s PhDs at risk 


To help counter the 
overemphasis on quantity 
rather than quality of research 
publications in China (Nature 
475, 267; 2011), many Chinese 
universities require their PhD 
candidates to publish a specified 
number of papers in high-impact 
journals before they are awarded 
their degree. 

Although laudable in 
principle, this requirement 
puts even more pressure on 
students. If that is compounded 
by inadequate guidance and 
support from mentors, it will 
encourage more scientific fraud 
and hamper the development of 
junior scientists in the long run. 
Derrick Y. F. Lai McGill 
University, Montreal, Canada. 
yukfo.lai@mail.mcegill.ca 


Helping hand for 
genomics in Africa 


We offer a personal example of 
the benefits of lending African 
scientists ‘a helping hand’ (Nature 
474, 542; 2011). After a five- 

year relationship, a genomics 
network partnership was last 
year officially set up between 

the J. Craig Venter Institute in 
San Diego, California, and the 
University of Limpopo, one of 
the least resourced universities in 
South Africa. 

The University of Limpopo, 
with campuses in Polokwane 
and Pretoria, is the result ofa 
merger between South Africa’s 
University of the North and the 
Medical University of Southern 
Africa — previously ‘non-white 
higher-education facilities as 
defined by the apartheid policies. 

The university is striving 
to build regional resources 
anda skill base in genomics to 
help find scientific solutions 
for rural Africa. International 
partnerships, such as the one 
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between our institutions, gives 
the University of Limpopo’s 
researchers access to current 
technologies, informatic 
expertise and training that is 
unavailable locally. 

Together we are seeking 
practical solutions through 
educational workshops, 
student and staff exchanges, 
improvements in capacity 
and infrastructure, and equal- 
contribution projects, while 
tackling the inevitable challenges 
as a single unit. 

Our experience suggests that 
‘joining hands’ could ultimately 
prove to bea successful strategy. 
Vanessa M. Hayes J. Craig Venter 
Institute, San Diego, California, 
USA. vhayes@jcvi.org 
Philip A. Venter University of 
Limpopo, Polokwane, South Africa. 
M. Jeffrey Mphahlele University 
of Limpopo, Pretoria, South Africa. 


Mathematics walks 
into history 


Mathematician William 

Rowan Hamilton did indeed 

hit on the idea of quaternions 
—acomplex-number system 
extending into four dimensions 
— while crossing a Dublin bridge 
in 1843 (Nature 475, 167; 2011). 
Yet it was what the Irishman 
was doing while absorbed in 
mathematics that gives him 

the last laugh, for quaternions 
are used in the clinical and 
laboratory analysis of bodily 
motion, notably of gait. So what 
was Hamilton doing? Walking 
into history. 

John Hart Oxford Brookes 
University, Oxford, UK. 
johnhart1@btconnect.com 
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Hydrogen for dinner 


The vast array of bacterium-animal symbioses at deep-sea hydrothermal vents was thought to be fuelled by just two 
chemicals. A study of one such symbiosis in its environmental context reveals a third energy source. SEE ARTICLE P. 176 


VICTORIA J. ORPHAN & TORI M. HOEHLER 


( "recente efforts have been made in 
recent years to develop hydrogen as an 
energy source, with one strategy being 

to harness the activity of hydrogen-metaboliz- 

ing microorganisms. On page 176 of this issue, 

Petersen et al.' report that humans are not the 

first animals to use such a strategy: at sea- 

floor hydrothermal vents, mussels bathed in 
hydrogen-rich fluids thrive by supporting and 
harvesting an internal population of hydrogen- 
consuming bacteria. Hydrogen is therefore 
the first chemical energy source for microbe- 
animal symbiosis to have been discovered in 

25 years, and joins hydrogen sulphide and 

methane on the short menu of compounds 

used to fuel organisms at hydrothermal vents. 

Billions of years of photosynthetic activity 
have left Earth with a thin veneer of highly 
oxidized material over a mass of chemically 
reduced rock — which means that the planet 
is, essentially, a battery. Hydrothermal circula- 
tion serves to tap this battery: fluids circulating 
through the sea-floor crust are charged with 
reduced (electron-laden) compounds and 
emerge as jets into the oxidizing environment 
of the ocean. The transport of compounds 
from the main reduced rock mass (the anode) 
to the ocean (the cathode) constitutes a flow 
of electrons. And, as with any battery, this 
electron flow is a source of usable energy. 

The energy delivered by vent-fluid chemis- 
try fuels thriving communities of microorgan- 
isms and animals — densely populated oases of 
life on an otherwise sparsely inhabited ocean 
floor (Fig. 1). But some of the vents’ most con- 
spicuous and abundant animal inhabitants 
have neither mouths with which to feed, nor 
the capability to directly metabolize venting 
chemicals. Instead, these organisms rely on 
remarkable symbioses with microorganisms 
that do have such a capability. The charac- 
terization of these unusual bacterium—animal 
partnerships was initiated 30 years ago, with 
reports” of a symbiosis between the giant 
tubeworm Riftia pachyptila and bacteria that 
oxidize hydrogen sulphide gas (H,S). Numer- 
ous other examples of symbiotic vent-dwelling 
animals have since been described. All are 
based on the metabolism of either methane or 
hydrogen sulphide’. 


Figure 1 | In hot water. These mussels and associated fauna are bathed in hydrothermal fluids at the 
Wideawake vent field on the Mid-Atlantic Ridge. (Photo taken by the remotely operated, deep-sea 
submersible MARUM-QUEST.) 


The seemingly restricted diet of these bac- 
terium-animal symbioses is in many ways 
surprising. Hydrogen, ammonium, metals and 
a variety of other reduced compounds can be 
abundant in vent fluids along with methane 
and hydrogen sulphide, and free-living micro- 
bial consumers of most of these compounds 
have been identified at vents’. Hydrogen, in 
particular, is a near-ubiquitous product of the 
reaction of water with oceanic crustal rocks, 
and is therefore a quantitatively significant 
component of most vent fluids. In vent sys- 
tems hosted by peridotite (mantle-like) rock, 
for example, hydrogen is present in greater 
abundance, and stores more energy that could 
potentially be used for microbial metabolism, 
than either methane or hydrogen sulphide’®, 

The apparent constraints on the chemical 
basis of bacterium-animal mutualisms could 
be due to physiological limitations of the host 
animals’. But they could also reflect the fact 
that our understanding of vent systems is still 
limited 34 years after their discovery. Petersen 
et al.' now reveal that these symbioses are 
indeed more metabolically versatile than was 
previously thought. 

The study of hydrogen use and other 
biological and geochemical processes in 
deep-sea hydrothermal vent systems poses 
unique challenges because of the remote 
location of the vents — which, on aver- 
age, are found at depths greater than 2 kilo- 
metres — and the heterogeneous and highly 
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variable physical and chemical conditions that 
frequently characterize them. Accessing these 
environments requires robotic or manned 
submersibles, but the resulting animal collec- 
tions, ecological observations and shipboard 
geochemical analyses often offer only limited 
‘snapshots’ of these dynamic ecosystems. 
Rather than rely on such discrete shipboard 
measurements, Petersen et al. deployed a mass 
spectrometer capable of being used in situ in 
the deep sea to map the distribution of chemi- 
cals within vent systems directly. This provided 
tantalizing evidence that both hydrogen and 
methane were being consumed within the 
densely colonized mussel beds. 

Although the authors’ in situ geochemical 
data were suggestive of active hydrogen metab- 
olism within the mussel beds, further evidence 
was required to directly link this metabolism 
to the two known microbial symbionts living 
in the molluscs’ gills. One of these microbes 
is known to metabolize hydrogen sulphide, 
whereas the other metabolizes methane 
(Fig. 2). Petersen and colleagues therefore 
incubated symbiont-laden gill tissue in sea 
water infused with hydrogen, and measured 
the rate at which the microbes fixed carbon 
dioxide. They found that the rate was similar to 
the fixation rate of tissues bathed in hydrogen 
sulphide, indicating that the sulphide-oxidiz- 
ing and/or methane-oxidizing symbionts were 
also responsible for the observed hydrogen 
consumption in the mussel beds. 


MARUM, UNIV. BREMEN 


The authors went on to perform a meta- 
genomic analysis of the symbionts in the gill 
tissue, and found that a key gene for hydrogen 
oxidation (hupL, which encodes part of the 
hydrogen-oxidizing enzyme [NiFe]-hydro- 
genase) is located on a DNA fragment that 
also contains sulphide-oxidation genes. This 
strongly suggests that the sulphide-metabo- 
lizing symbionts are responsible for hydrogen 
consumption. To get specific confirmation of 
this, Petersen et al. used fluorescent probes 
that target the hupL gene and the hydrogenase 
protein in the symbionts. They observed that 
sulphide-oxidizing symbionts in the gill were 
lit up by the hupL probe, whereas neighbour- 
ing methane-oxidizing bacteria were not. 
Collectively, the authors’ observations present 
a compelling case that the long-known bac- 
terium—mussel symbiosis can operate using a 
previously unknown metabolism. 

It is likely that other symbioses will be dis- 
covered that also use hydrogen as a fuel source, 
given that symbioses involving relatives of 
the mussels’ sulphide-oxidizing symbionts 
are widespread in reducing environments. 
Indeed, the authors find’ that symbionts of 
the vent-dwelling shrimp Rimicaris exoculata 
have hupL, providing genetic evidence that 
crustaceans might also tap into hydrogen as an 
energy source through symbiosis. Follow-up 
studies should now determine the full extent of 
hydrogen use by symbionts, as well as the envi- 
ronmental and biological factors that influence 


Methane-oxidizing 
bacteria 


Figure 2 | Hydrogen-consuming symbiosis. 
Bathymodiolus puteoserpentis mussels live at 
hydrothermal deep-sea vents, which emit hydrogen, 
methane (CH,) and hydrogen sulphide (H,S), among 
other chemicals. The mussels draw water through 
their siphons and into enlarged respiratory gills that 
house symbiotic bacteria. Some of these bacteria 
harness energy by oxidizing methane to carbon 
dioxide (blue arrow), whereas others do this by 
oxidizing hydrogen sulphide to sulphate (SO, light 
green arrow). Petersen et al.’ report that the sulphide- 
oxidizing bacteria also use hydrogen as an energy 
source, oxidizing it to water (dark green arrow). 


when and where hydrogen utilization occurs 
in chemosynthetic ecosystems. 

Petersen and colleagues’ work exemplifies 
the technology-driven revolution that is occur- 
ring in the biological sciences. The continuous 
development of ever more powerful and spe- 
cific molecular tools allows taxonomic iden- 
tity and gene content to be linked to metabolic 
potential and activity, and to be visualized in 
context. As these techniques converge with 
new instrumentation that allows the in situ 
characterization of physicochemical param- 
eters — even in environments as remote and 
extreme as hydrothermal vents — biologists 
are freed from their reliance on model organ- 
isms in artificial surroundings. Now, more 
than ever, our understanding of biology can be 
placed in the correct environmental and eco- 
logical context, enabling the discovery of pre- 
viously unknown activities that support life. m 
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Microwave ion-trap 
quantum computing 


Anew type of ion-trap quantum technology has been developed that uses microwave 
radiation to perform computations. It will considerably simplify the practical 
implementation of large-scale quantum computers. SEE LETTERS P.181 & P:185 


WINFRIED K. HENSINGER 


he strange phenomena of quantum 

physics, such as the possibility of a sin- 

gle atom being in two different places 
simultaneously, have mystified experienced 
physicists and students alike. These phenom- 
ena are not just theoretical curiosities: a new, 
practical, field of quantum physics has become 
particularly vibrant during the past ten years. 
It concerns harnessing the power of quantum 
effects to produce an innovative type of tech- 
nology — quantum technology. This has the 
potential to revolutionize computing. Although 
ways to implement this technology are being 
pursued in various physical systems, quantum 
computing using trapped ions’ * has undoubt- 
edly been the most successful so far. In this 
issue, Ospelkaus et al.* (page 181) and Timoney 
et al.” (page 185) describe a trapped-ion 
approach that further bolsters the enormous 
potential of this system for the implementation 
of large-scale quantum computers. 

Ospelkaus et al.* report the realization of 
quantum gates using microwave radiation 
instead of laser beams, which have until now 
been used to implement such gates. Quan- 
tum gates are the analogues of classical logic 
gates, and are used to perform computa- 
tions ina quantum processor. Timoney et al.” 


discuss a new approach to quantum comput- 
ing — involving microwaves — that is highly 
resilient to noise (often referred to as deco- 
herence) acting on the physical system used 
to execute the computations. Decoherence 
has the potential to destroy quantum effects 
and interfere with the operation of a quantum 
processor. Timoney and colleagues’ scheme 
effectively shields the quantum processor from 
decoherence. 

Quantum gates executed in a quantum pro- 
cessor are based on the creation of entangle- 
ment. Entanglement is one of the non-intuitive 
phenomena of quantum physics, whereby 
the properties of multiple systems, such as 
groups of ions, are correlated. Ordinarily, the 
outcome of measuring a particular property 
on an ion yields a completely random result. 
But measuring the same property on each 
ion of a set of entangled ions produces cor- 
related results. Non-entangled ions do not 
produce such correlations. An easy way to 
understand the situation is to imagine two 
people each tossing a coin. The outcome 
of each coin toss is random. However, if the 
coins were entangled in a particular way, the 
outcome of the coin toss would always be 
the same for both coins — with both people 
getting heads or tails. 

The past few years have witnessed ground- 
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The authors went on to perform a meta- 
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their siphons and into enlarged respiratory gills that 
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dioxide (blue arrow), whereas others do this by 
oxidizing hydrogen sulphide to sulphate (SO, light 
green arrow). Petersen et al.’ report that the sulphide- 
oxidizing bacteria also use hydrogen as an energy 
source, oxidizing it to water (dark green arrow). 


when and where hydrogen utilization occurs 
in chemosynthetic ecosystems. 

Petersen and colleagues’ work exemplifies 
the technology-driven revolution that is occur- 
ring in the biological sciences. The continuous 
development of ever more powerful and spe- 
cific molecular tools allows taxonomic iden- 
tity and gene content to be linked to metabolic 
potential and activity, and to be visualized in 
context. As these techniques converge with 
new instrumentation that allows the in situ 
characterization of physicochemical param- 
eters — even in environments as remote and 
extreme as hydrothermal vents — biologists 
are freed from their reliance on model organ- 
isms in artificial surroundings. Now, more 
than ever, our understanding of biology can be 
placed in the correct environmental and eco- 
logical context, enabling the discovery of pre- 
viously unknown activities that support life. m 
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related results. Non-entangled ions do not 
produce such correlations. An easy way to 
understand the situation is to imagine two 
people each tossing a coin. The outcome 
of each coin toss is random. However, if the 
coins were entangled in a particular way, the 
outcome of the coin toss would always be 
the same for both coins — with both people 
getting heads or tails. 

The past few years have witnessed ground- 
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50 Years Ago 


The Prehistoric Chamber Tombs of 
France. By Glyn Daniel — To win 
appreciation for some phase of 
antiquity on the strength of modern 
excavations is far easier than what 
has been undertaken here — where 
the antiquities concerned have been 
known for centuries, until only 
yesterday often excavated badly, and 
celebrated in a literature in which 
perverse and obsolete terminologies 
have run riot. Dr. Daniel, supported 
by his wife, and with backing from 
many quarters as well as friendly 
French co-operation, has for long 
been working towards a systematic 
account of the megalithic and related 
stone tombs of France, their form 
and contents, and their placing in 
the frame of European prehistory in 
the third and second millennia B.c., 
to which in general they belong. This 
book, by no means his first study of 
megalithic structures, is his best so 
far. It is not too hard to read. 

From Nature 12 August 1961 


100 Years Ago 


In the July number of The American 
Naturalist Dr. O. P. Hay reopens 

the discussion with regard to the 
position of the limbs in Diplodocus 
and other sauropod dinosaurs, 
criticising the views of those who 
assert that these reptiles carried 
themselves in elephantine fashion, 
and maintaining his own opinion 
that the general pose was more after 
the crocodilian style... Mr. Hay 
expresses doubts as to whether the 
erect bird-like posture attributed 

to the carnivorous dinosaurs of the 
Jurassic is really true to nature ... 

In reference to the opinion of 

Dr. Matthew that Sauropods were 
too bulky to have lived on land, it is 
added that “the law to which he gives 
expression does, of course, prescribe 
a limit to the size an animal can 
attain, but who has yet determined 
what that limit is?” 

From Nature 10 August 1911 


Entangled ions 


Microwave 
waveguide 


Figure 1 | Microwave ion-trap chip. Ospelkaus and colleagues’ approach’ to entangling pairs of ions and 
generating a quantum gate involves launching microwaves into a waveguide incorporated on a chip. 


breaking advances in quantum-information 
processing using trapped ions, including the 
entanglement of up to 14 ions’, and the devel- 
opment of other entanglement-based pro- 
tocols, such as the realization of a number of 
quantum algorithms’ and of teleportation”®. 
These advances have been made in experi- 
ments in which laser beams are used to perform 
entanglemewnt operations. In 2001, Mintert 
and Wunderlich had the visionary idea’ of 
implementing quantum gates using long-wave- 
length radiation, such as microwaves or radio 
waves. Whereas laser beams must be carefully 
aligned to interact with the trapped ions that 
are to be entangled, microwaves can be applied 
via waveguides (structures that can guide 
radiation) that are part of the chip on which the 
ion trap is integrated”, and so do not require 
alignment. 

What's more, it is much easier and less costly 
to generate microwave radiation than it is to 
use the complicated laser systems currently 
employed, and highly stable microwave sources 
are readily available. Large-scale quantum com- 
puters may require many millions of individu- 
ally trapped ions, each constituting a single 
quantum bit (the basic unit of information 
storage in a quantum computer). As a result, 
creating the required number of laser beams 
to entangle the ions may entail significant engi- 
neering and come at a considerable cost. By 
contrast, the use of microwave radiation for the 
same purpose would be much easier and would 
make the construction of a large-scale ion-trap 
quantum-in formation processor much simpler. 

Based on their proposal" to make use of the 
oscillating magnetic fields that are inherent to 
microwave radiation (the original proposal by 
Mintert and Wunderlich’ requires static mag- 
netic fields in addition to microwaves), Ospel- 
kaus et al.* have realized the first microwave 
quantum gate. They achieved this by using a 
waveguide integrated into a microchip (Fig. 1) 
that holds the ion-trap structure. The micro- 
chip contains electrodes that produce electric 
fields capable of trapping two ions just above 
the chip’s surface. Multiple pulses of micro- 
wave radiation are then applied to the trapped 
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ions through the waveguide, effectively entan- 
gling the two ions and successfully executing 
a quantum gate. 

Meanwhile, Timoney et al.” trapped 
individual ions and applied a number of 
microwave pulses to them. This approach sets 
the ions to a state in which they are decoupled 
from outside noise. An easy way to visual- 
ize this is by considering the suspension of 
a common car. Springs in the car’s suspen- 
sion system decouple the car frame from the 
wheels, largely isolating the driver from vibra- 
tions caused by uneven road surfaces. In a 
similar way, Timoney and colleagues’ scheme 
allows trapped ions to be isolated from exter- 
nal disturbances that would otherwise have 
the potential to disturb the operation of a 
microwave trapped-ion quantum processor. 

The achievements of Ospelkaus et al.* and 
Timoney et al.” constitute step-changing inno- 
vations for quantum computing with trapped 
ions because they will probably aid the pro- 
duction of large-scale ion-trap quantum com- 
puters on foreseeable timescales. Quantum 
computing is likely to revolutionize many 
areas of science, and we have only just started 
to appreciate its true potential. m 
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oscillating magnetic fields that are inherent to 
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that holds the ion-trap structure. The micro- 
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Enigma of the recent 
methane budget 


The previously increasing atmospheric methane concentration has inexplicably 
stalled over the past three decades. This may be due to a fall in fossil-fuel emissions or 
to farming practices that are curtailing microbial sources. SEE LETTERS P.194 & P.198 


MARTIN HEIMANN 


here has been a recent unexpected 

variation in the atmospheric concen- 

tration of methane (CH,) — the second 
most important anthropogenically modified 
greenhouse gas after carbon dioxide. Its growth 
rate has declined almost to a standstill in the 
past three decades, despite more than doubling 
since pre-industrial times; signs of renewed 
growth are only now starting to appear. What 
has caused the puzzling behaviour of this 
atmospheric trace gas? In potentially contradic- 
tory studies on pages 194 and 198 of this issue, 
Kai et al.'and Aydin et al.’ attempt to answer 
this question. 

The growth rate of the atmospheric meth- 
ane concentration effectively reflects the global 
methane budget (Fig. 1), which at present deli- 
cately balances large sinks and sources. Global 
sinks arise primarily from the activity of the 
hydroxyl radical (OH"), which is involved 
in photochemical oxidation reactions in the 
atmosphere. Although sinks are quite well 
understood, the relative contributions of the 
various processes that produce methane are 
uncertain. 

About 60% of global methane stems from 
human activities. This includes emissions 
associated with energy production and usage 
— such as coalmining, incomplete combustion 
of fossil fuels and gas leaks. These fossil-fuel- 
derived emissions, together with methane 
from landfills, organic waste, cattle raising, rice 
agriculture and biomass burning, were prob- 
ably the main cause of the increasing concen- 
trations after pre-industrial times. Methane 
also forms naturally, mostly from anaerobic 
decomposition of organic carbon in wetlands, 
with lesser contributions from the ocean, 
termites, wild animals, wildfires and various 
geological sources. 

How can the relative contributions of the 
various sources to the global methane budget 
be determined? One technique depends on 
the fact that most of the source processes have 
a distinct geographical signature, which is 
reflected in the atmospheric methane-concen- 
tration distribution. For example, fossil-fuel 
methane emissions that predominantly occur 
in the Northern Hemisphere cause a north- 
south concentration gradient that is visible in 


atmospheric measurements, including those 
made at surface stations’ and from space-based 
observations’. This method identified a fall in 
fossil-fuel-derived methane emissions in the 
1990s, reinforced by a coincident decrease 
in wetland emissions since 2000, as the main 
drivers of the global decline in methane growth 
rate over the past 30 years” , albeit with substan- 
tial uncertainties. Other methane sources also 
have distinct geographical patterns, but many 
of these overlap, and so the attribution and 
quantification of the different sources from 
atmospheric measurements are not unique. 
Aydin and colleagues’ offer new information 
on the history of methane emissions from fos- 
sil fuels in the Northern and Southern Hemi- 
spheres over the past 60 years. They achieve 
their reconstruction by measuring the ethane 
(C,H,) in air trapped in porous snow-ice (firn) 
in Greenland and Antarctica. Like methane, 
ethane is released during the mining, transport 
and incomplete burning of fossil fuels, but also 
from incomplete biomass burning. With the 
help of a simple atmospheric mixing model, 
the authors resolve the ethane-concentration 
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record into changes in ethane-source strength 
and, given typical hemispheric patterns of 
fossil-fuel and biomass-burning sources, 
into changes over time of these two sources. 
They then assume fixed methane-to-ethane 
emission ratios in estimating the history of 
fossil-fuel methane emissions. 

Their results are surprising. Whereas the 
emissions deduced for biomass burning are 
consistent with independent bottom-up 
estimates, the inferred history of fossil-fuel- 
derived methane emissions before 1980 is 
strikingly different — double the estimates 
from standard databases based on the statistics 
of fossil-fuel production. During 1980-2000, 
the record of fossil-fuel methane emissions 
shows an almost 30% decline, which would go 
a long way towards explaining the observed 
decrease in the global methane growth rate. 

Kai and colleagues’ offer an alternative 
explanation. They make use of the fact that 
methane from fossil fuels is enriched in its 
stable °C/”C carbon-isotope ratio, whereas 
methane from microbial sources (mainly wet- 
lands and rice paddies) is depleted in *C/”C 
with respect to the atmospheric background. 
Furthermore, variations in the 7H/'H hydro- 
gen-isotope ratio in methane are primarily 
affected by changes in the photochemical sink. 
Observations of the atmospheric methane 
concentration, in conjunction with measure- 
ments of its stable-isotope ratios, thus provide 
an alternative means to distinguish between 
different source categories. In combination 
with a simple model of atmospheric mixing, 
the authors compare the simulated atmos- 
pheric signatures of various methane-source 
scenarios with observations from the past two 
decades in each hemisphere. 
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Figure 1 | Changes in global atmospheric methane over the past three decades. a, Globally averaged 
methane concentration, compiled from a global network of atmospheric measurement stations. 

b, Globally averaged growth rate of methane in the atmosphere. Dashed lines indicate + 1 standard 
deviation; p.p.b., parts per billion. (Data and graphic are from E. Dlugokencky*, NOAA-GMD.) 
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Surprisingly, Kai et al. find that a reduction 
in the fossil-fuel methane source is not com- 
patible with these measurements, and that 
the isotope record can be explained only by 
a reduction in the microbial sources in the 
Northern Hemisphere. A drying trend in 
northern wetlands could have contributed 
to this finding’, but the authors convinc- 
ingly show that methane emissions from rice 
agriculture, particularly in China, must also 
have decreased. Their conclusion is based on 
changes in agricultural practices: new high- 
yield rice species, together with greater appli- 
cation of fertilizer, require shorter inundation 
periods, making substantial water savings and 
reducing methane emissions. 

Can these conflicting inferences on the 
recent slow-down of global methane growth be 
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reconciled? Because of the limited data coverage 
and simplistic analysis assumptions of the two 
studies'”, there are considerable uncertainties 
in the deduced methane-source variations, but 
the different scenarios are plausible and com- 
patible with their respective observations. The 
challenge now is to bring the different lines of 
evidence together, perhaps by using a more 
advanced modelling framework and improved 
bottom-up inventory information on the vari- 
ous methane-source categories. More extended 
observations will help too — particularly the 
mapping of atmospheric methane concentra- 
tion by current and upcoming satellite missions. 

These studies’” illustrate the importance 
of high-precision, long-term observations 
of methane concentration and isotope com- 
position, and of auxiliary trace gases such 


Bespoke cells for 
the human brain 


Human skin cells have been directly converted into neurons, an achievement 
that could lead to the cell-based treatment of neurodegenerative disorders. But 
the road ahead remains long and tortuous. SEE LETTERS P.220, P.224 & P.228 


MICHAEL SENDTNER 


major goal for biomedical researchers 

has been to repair damaged cells and 

tissues by reinitiating the body’s devel- 
opmental mechanisms. In this respect, the use 
of embryonic stem cells — and of induced 
pluripotent stem cells, which are generated by 
reprogramming differentiated adult cells — 
has seemed promising. But these cells come 
with various problems, including ethical con- 
cerns, and potential tumorigenicity and rejec- 
tion by the host immune system. Three papers 
in this issue’* describe the direct conversion 
of neurons from skin fibroblast cells. There 
is hope, although no compelling evidence, 
that neurons generated in this way might be 
superior to those generated from induced 
pluripotent stem cells, thereby sidestepping 
the problems of using such cells*. 

The conversion of mouse fibroblasts into 
neurons was reported last year®. The same 
team (Pang et al.', page 220) now find, how- 
ever, that introducing the same three tran- 
scription factors (Brn2, Ascll/Mash1 and 
Myt1]) used in their original study into human 
fibroblasts is not sufficient to convert the cells 
into functional neurons; the resulting cells are 
immature, and even after 3 weeks in culture 
do not become functional, as judged by meas- 
urements of action potentials. Nonetheless, 
the authors show that the efficiency of human 


fibroblast conversion into neurons is enhanced 
by including an extra transcription factor, 
NeuroD1, a member of the bHLH family. 
Caiazzo et al.” (page 224) also set out to con- 
vert human fibroblasts into a specific type of 
neuron, in this case neurons of the midbrain 
that secrete the neurotransmitter dopamine. 
Starting their study in mice, they tested the 
efficiency of mixtures of transcription fac- 
tors (combinations of Ascl1, Brn2, Mytll and 
others) in the conversion process. The win- 
ning combination was that of Ascl1, Nurrl and 
Lmxla; Brn2 and Myt11 did not lead to dopa- 
mine-secreting neurons. The authors provide 
evidence that the conversion route was direct, 
and that the resulting neurons were function- 
ally surprisingly similar to primary dopamin- 
ergic neurons isolated from the brain. What’s 
more, when the converted dopaminergic cells 
were grafted into neonatal mouse brains, they 
became integrated and were functional. 
Caiazzo and colleagues then applied their 
method to human fibroblasts, including 
those from patients with Parkinson's disease, 
a neurodegenerative disorder in which dopa- 
minergic neurons are damaged. The conver- 
sion efficiency of adult human fibroblasts was 
10-20-fold lower than that of their mouse 
equivalents, and the resulting cells were less 
mature. This discrepancy points to species 
differences between mice and humans’. The 
authors’ observations fit well with those of 


158 | NATURE | VOL 476 | 11 AUGUST 2011 


© 2011 Macmillan Publishers Limited. All rights reserved 


as ethane, in distinguishing between the 
contributions from different sources. But more 
insight is needed to solve the enigma of the 
recent methane budget if the evolution of this 
important greenhouse gas over the twenty-first 
century is to be predicted. m 
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another team’ that recently reported the direct 
conversion of human fibroblasts into dopa- 
minergic cells using a different combination 
of transcription factors. 

Yoo et al.’ (page 228) approached the search 
for a suitable conversion method from a differ- 
ent angle. They show that combined expres- 
sion of miR-9/9* and miR-124 — two members 
of a class of short regulatory RNA sequences 
called microRNAs (miRNAs) — is sufficient 
to convert human fibroblasts into neurons. 
Again, the conversion process was greatly 
enhanced by the introduction of another 
bHLH-family transcription factor, NeuroD2. 
And increased expression of Ascll and Mytll 
further increased the conversion efficiency of 
neonatal human fibroblasts: about 80% of the 
resulting neurons showed action-potential- 
like activity. As in the other two studies, Yoo 
and colleagues write that adult fibroblasts were 
less amenable to conversion. 

Several common themes emerge. The stud- 
ies’ * highlight the importance of the bHLH- 
family transcription factors and the role of 
miRNAs in the conversion process. Moreover, 
they all show that the conversion of mouse 
and human fibroblasts into neurons requires 
different protocols, and that conversion of 
fibroblasts from adult or aged individuals is 
much less productive than that of embryonic 
or neonatal fibroblasts. 

The essential role of NeuroD1 and NeuroD2 
for efficient conversion correlates well with the 
crucial part played by these two transcription 
factors in neuron differentiation during normal 
brain development, and their reduced expres- 
sion as neurogenesis decreases during ageing”®. 
A relevant question is whether the degree of 
methylation of the genomic region contain- 
ing the genes for these transcription factors 
is higher in human than in mouse fibroblasts 
and increases during ageing, thereby resulting 
in their reduced expression. DNA methylation 
is a crucial regulator of gene expression. 

How miR-9/9* and miR-124 substitute for 
Brn2 — and possibly for Ascll and Myt1l — in 
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Surprisingly, Kai et al. find that a reduction 
in the fossil-fuel methane source is not com- 
patible with these measurements, and that 
the isotope record can be explained only by 
a reduction in the microbial sources in the 
Northern Hemisphere. A drying trend in 
northern wetlands could have contributed 
to this finding’, but the authors convinc- 
ingly show that methane emissions from rice 
agriculture, particularly in China, must also 
have decreased. Their conclusion is based on 
changes in agricultural practices: new high- 
yield rice species, together with greater appli- 
cation of fertilizer, require shorter inundation 
periods, making substantial water savings and 
reducing methane emissions. 

Can these conflicting inferences on the 
recent slow-down of global methane growth be 
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and simplistic analysis assumptions of the two 
studies'”, there are considerable uncertainties 
in the deduced methane-source variations, but 
the different scenarios are plausible and com- 
patible with their respective observations. The 
challenge now is to bring the different lines of 
evidence together, perhaps by using a more 
advanced modelling framework and improved 
bottom-up inventory information on the vari- 
ous methane-source categories. More extended 
observations will help too — particularly the 
mapping of atmospheric methane concentra- 
tion by current and upcoming satellite missions. 

These studies’” illustrate the importance 
of high-precision, long-term observations 
of methane concentration and isotope com- 
position, and of auxiliary trace gases such 
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opmental mechanisms. In this respect, the use 
of embryonic stem cells — and of induced 
pluripotent stem cells, which are generated by 
reprogramming differentiated adult cells — 
has seemed promising. But these cells come 
with various problems, including ethical con- 
cerns, and potential tumorigenicity and rejec- 
tion by the host immune system. Three papers 
in this issue’* describe the direct conversion 
of neurons from skin fibroblast cells. There 
is hope, although no compelling evidence, 
that neurons generated in this way might be 
superior to those generated from induced 
pluripotent stem cells, thereby sidestepping 
the problems of using such cells*. 

The conversion of mouse fibroblasts into 
neurons was reported last year®. The same 
team (Pang et al.', page 220) now find, how- 
ever, that introducing the same three tran- 
scription factors (Brn2, Ascll/Mash1 and 
Myt1]) used in their original study into human 
fibroblasts is not sufficient to convert the cells 
into functional neurons; the resulting cells are 
immature, and even after 3 weeks in culture 
do not become functional, as judged by meas- 
urements of action potentials. Nonetheless, 
the authors show that the efficiency of human 


fibroblast conversion into neurons is enhanced 
by including an extra transcription factor, 
NeuroD1, a member of the bHLH family. 
Caiazzo et al.” (page 224) also set out to con- 
vert human fibroblasts into a specific type of 
neuron, in this case neurons of the midbrain 
that secrete the neurotransmitter dopamine. 
Starting their study in mice, they tested the 
efficiency of mixtures of transcription fac- 
tors (combinations of Ascl1, Brn2, Mytll and 
others) in the conversion process. The win- 
ning combination was that of Ascl1, Nurrl and 
Lmxla; Brn2 and Myt11 did not lead to dopa- 
mine-secreting neurons. The authors provide 
evidence that the conversion route was direct, 
and that the resulting neurons were function- 
ally surprisingly similar to primary dopamin- 
ergic neurons isolated from the brain. What’s 
more, when the converted dopaminergic cells 
were grafted into neonatal mouse brains, they 
became integrated and were functional. 
Caiazzo and colleagues then applied their 
method to human fibroblasts, including 
those from patients with Parkinson's disease, 
a neurodegenerative disorder in which dopa- 
minergic neurons are damaged. The conver- 
sion efficiency of adult human fibroblasts was 
10-20-fold lower than that of their mouse 
equivalents, and the resulting cells were less 
mature. This discrepancy points to species 
differences between mice and humans’. The 
authors’ observations fit well with those of 
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as ethane, in distinguishing between the 
contributions from different sources. But more 
insight is needed to solve the enigma of the 
recent methane budget if the evolution of this 
important greenhouse gas over the twenty-first 
century is to be predicted. m 
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another team’ that recently reported the direct 
conversion of human fibroblasts into dopa- 
minergic cells using a different combination 
of transcription factors. 

Yoo et al.’ (page 228) approached the search 
for a suitable conversion method from a differ- 
ent angle. They show that combined expres- 
sion of miR-9/9* and miR-124 — two members 
of a class of short regulatory RNA sequences 
called microRNAs (miRNAs) — is sufficient 
to convert human fibroblasts into neurons. 
Again, the conversion process was greatly 
enhanced by the introduction of another 
bHLH-family transcription factor, NeuroD2. 
And increased expression of Ascll and Mytll 
further increased the conversion efficiency of 
neonatal human fibroblasts: about 80% of the 
resulting neurons showed action-potential- 
like activity. As in the other two studies, Yoo 
and colleagues write that adult fibroblasts were 
less amenable to conversion. 

Several common themes emerge. The stud- 
ies’ * highlight the importance of the bHLH- 
family transcription factors and the role of 
miRNAs in the conversion process. Moreover, 
they all show that the conversion of mouse 
and human fibroblasts into neurons requires 
different protocols, and that conversion of 
fibroblasts from adult or aged individuals is 
much less productive than that of embryonic 
or neonatal fibroblasts. 

The essential role of NeuroD1 and NeuroD2 
for efficient conversion correlates well with the 
crucial part played by these two transcription 
factors in neuron differentiation during normal 
brain development, and their reduced expres- 
sion as neurogenesis decreases during ageing”®. 
A relevant question is whether the degree of 
methylation of the genomic region contain- 
ing the genes for these transcription factors 
is higher in human than in mouse fibroblasts 
and increases during ageing, thereby resulting 
in their reduced expression. DNA methylation 
is a crucial regulator of gene expression. 

How miR-9/9* and miR-124 substitute for 
Brn2 — and possibly for Ascll and Myt1l — in 


the conversion process is not fully understood. 
Remodelling of chromatin (DNA-protein 
complexes), which in turn affects gene expres- 
sion, seems to have a central role. Yoo et al.* 
report that the two miRNAs modify the com- 
position of the BAF chromatin-remodelling 
complex by regulating expression of its sub- 
units, thereby transforming it into a chromatin 
remodeller that is characteristic of differenti- 
ated neurons’. But it remains unclear whether 
the specific chromatin structure in the con- 
verted neurons is the same as that in primary 
neurons. 

In this respect, Caiazzo and colleagues’ 
data’ are informative. They show that the 
gene-expression profiles of the dopaminergic 
neurons generated and those of isolated mouse 
midbrain dopaminergic neurons are similar 
but distinct: about 160 genes were expressed 
differently, with a more than fivefold variation 
in expression. This indicates that functional 
similarity of the converted neurons does not 
necessarily correspond to similar chromatin 
structure or gene-expression levels, and so 
cautions against the cells’ premature clini- 
cal use. Instead, the differences between the 
converted neurons and their corresponding 
primary neurons must be characterized, and 
whether they give rise to unwanted side effects 
should be explored. 

The miRNAs used by Yoo and co-workers’ 
probably also affect the expression of other 
proteins involved in cell-fate switching and 
neuron differentiation, including components 
of other chromatin-remodelling complexes. So 
before the clinical use of converted cells can be 
contemplated, further work should determine 
how the various chromatin-remodelling fac- 
tors, and other factors that affect gene expres- 
sion, contribute to cell conversion and how 
they can be controlled. Another question to 
be addressed is how the similarities and dif- 
ferences between converted cells and primary 
neurons, in terms of gene expression and 
chromatin structure, correlate with the func- 
tionality of the converted neurons — with 
their neurotransmitter production, firing of 
action potentials and functional integration 
into neuronal networks. It is also not known 
how gene expression and chromatin structure 
are shaped by intrinsic mechanisms and by the 
cells’ immediate environment, in particular 
after their transplantation. 

Thus, an area for further exploration is how 
a diseased brain’s environment influences the 
functionality and gene-expression profiles of 
the transplanted converted neurons. Whether 
the converted cells are transplanted into the 
supportive environment of a neonatal mouse 
brain or into a diseased or aged human brain 
probably makes a difference. The finding that 
DNA methylation is dynamic in postnatal 
neurons" raises hopes that the environment 
into which these converted cells are trans- 
ferred could contribute to correctly shaping 
their gene-expression profile for long-term 


integration and function. However, the diseased 
brain in aged humans might not be so good at 
doing this. Therefore, investigating strategies to 
compensate for this deficit is perhaps the next 
big challenge in developing cell-based therapies 
for neurodegenerative disorders. m 
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The chemistry inside 


To understand the properties of many useful materials, the chemical structures 
that form within them from elements of low relative atomic mass must be 
determined. A new X-ray imaging technique does just that. 


CHRISTIAN G. SCHROER 


ne of the most striking properties 

of X-rays is their ability to penetrate 

matter. In fact, X-rays are partially 
absorbed as they pass through an object, and 
cast a ‘shadow’ of the structures inside — an 
effect that underpins medical X-ray imaging. 
As objects become bigger, harder (more ener- 
getic) X-rays are needed to penetrate them. 
The problem is that the harder the X-rays, 
the smaller the imaging contrast for light ele- 
ments — those that have low relative atomic 
masses, such as carbon, nitrogen and oxygen. 
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Reporting in Nature Materials, Huotari et al. 
now describe a way of using hard X-rays to 
make three-dimensional images of objects that 
also detects light elements with high sensitiv- 
ity. The technique even distinguishes between 
different bonding modes of those elements, 
and should enable studies of the interiors of 
diverse objects, such as biological systems, 
working catalytic reactors and fuel cells. 
When a crimson rose is illuminated by 
sunlight, it appears red to us even though the 
light contains all the colours of the rainbow. 
This is because the different colours in the 
light are absorbed and scattered differently 


Figure 1 | X-ray scattering for three-dimensional imaging. a, Huotari et al.' report a technique for 
imaging samples of materials that contain light elements such as carbon, nitrogen and oxygen. A 
cross-section of a sample is illuminated by a monochromatic, two-dimensional X-ray beam. X-ray 
scattering occurs, which can alter the energy of the scattered X-rays. A spherically curved crystal filters 
the scattered X-rays by reflecting only those of a certain pre-set energy, and focuses them onto a detector 
to form an image of the cross-section. Three-dimensional images are obtained by scanning samples 
through the incident beam. b, The authors’ technique can distinguish between different chemical 
bonding modes in samples. This image shows part of a sample in which a diamond is encased by graphite. 
Regions containing diamond carbon atoms are shown in red, and those containing graphite carbon atoms 


are in blue. (Figure adapted from ref. 1.) 
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the conversion process is not fully understood. 
Remodelling of chromatin (DNA-protein 
complexes), which in turn affects gene expres- 
sion, seems to have a central role. Yoo et al.* 
report that the two miRNAs modify the com- 
position of the BAF chromatin-remodelling 
complex by regulating expression of its sub- 
units, thereby transforming it into a chromatin 
remodeller that is characteristic of differenti- 
ated neurons’. But it remains unclear whether 
the specific chromatin structure in the con- 
verted neurons is the same as that in primary 
neurons. 

In this respect, Caiazzo and colleagues’ 
data’ are informative. They show that the 
gene-expression profiles of the dopaminergic 
neurons generated and those of isolated mouse 
midbrain dopaminergic neurons are similar 
but distinct: about 160 genes were expressed 
differently, with a more than fivefold variation 
in expression. This indicates that functional 
similarity of the converted neurons does not 
necessarily correspond to similar chromatin 
structure or gene-expression levels, and so 
cautions against the cells’ premature clini- 
cal use. Instead, the differences between the 
converted neurons and their corresponding 
primary neurons must be characterized, and 
whether they give rise to unwanted side effects 
should be explored. 

The miRNAs used by Yoo and co-workers’ 
probably also affect the expression of other 
proteins involved in cell-fate switching and 
neuron differentiation, including components 
of other chromatin-remodelling complexes. So 
before the clinical use of converted cells can be 
contemplated, further work should determine 
how the various chromatin-remodelling fac- 
tors, and other factors that affect gene expres- 
sion, contribute to cell conversion and how 
they can be controlled. Another question to 
be addressed is how the similarities and dif- 
ferences between converted cells and primary 
neurons, in terms of gene expression and 
chromatin structure, correlate with the func- 
tionality of the converted neurons — with 
their neurotransmitter production, firing of 
action potentials and functional integration 
into neuronal networks. It is also not known 
how gene expression and chromatin structure 
are shaped by intrinsic mechanisms and by the 
cells’ immediate environment, in particular 
after their transplantation. 

Thus, an area for further exploration is how 
a diseased brain’s environment influences the 
functionality and gene-expression profiles of 
the transplanted converted neurons. Whether 
the converted cells are transplanted into the 
supportive environment of a neonatal mouse 
brain or into a diseased or aged human brain 
probably makes a difference. The finding that 
DNA methylation is dynamic in postnatal 
neurons" raises hopes that the environment 
into which these converted cells are trans- 
ferred could contribute to correctly shaping 
their gene-expression profile for long-term 


integration and function. However, the diseased 
brain in aged humans might not be so good at 
doing this. Therefore, investigating strategies to 
compensate for this deficit is perhaps the next 
big challenge in developing cell-based therapies 
for neurodegenerative disorders. m 
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To understand the properties of many useful materials, the chemical structures 
that form within them from elements of low relative atomic mass must be 
determined. A new X-ray imaging technique does just that. 
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ne of the most striking properties 

of X-rays is their ability to penetrate 

matter. In fact, X-rays are partially 
absorbed as they pass through an object, and 
cast a ‘shadow’ of the structures inside — an 
effect that underpins medical X-ray imaging. 
As objects become bigger, harder (more ener- 
getic) X-rays are needed to penetrate them. 
The problem is that the harder the X-rays, 
the smaller the imaging contrast for light ele- 
ments — those that have low relative atomic 
masses, such as carbon, nitrogen and oxygen. 
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Reporting in Nature Materials, Huotari et al. 
now describe a way of using hard X-rays to 
make three-dimensional images of objects that 
also detects light elements with high sensitiv- 
ity. The technique even distinguishes between 
different bonding modes of those elements, 
and should enable studies of the interiors of 
diverse objects, such as biological systems, 
working catalytic reactors and fuel cells. 
When a crimson rose is illuminated by 
sunlight, it appears red to us even though the 
light contains all the colours of the rainbow. 
This is because the different colours in the 
light are absorbed and scattered differently 


Figure 1 | X-ray scattering for three-dimensional imaging. a, Huotari et al.' report a technique for 
imaging samples of materials that contain light elements such as carbon, nitrogen and oxygen. A 
cross-section of a sample is illuminated by a monochromatic, two-dimensional X-ray beam. X-ray 
scattering occurs, which can alter the energy of the scattered X-rays. A spherically curved crystal filters 
the scattered X-rays by reflecting only those of a certain pre-set energy, and focuses them onto a detector 
to form an image of the cross-section. Three-dimensional images are obtained by scanning samples 
through the incident beam. b, The authors’ technique can distinguish between different chemical 
bonding modes in samples. This image shows part of a sample in which a diamond is encased by graphite. 
Regions containing diamond carbon atoms are shown in red, and those containing graphite carbon atoms 


are in blue. (Figure adapted from ref. 1.) 
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by molecules in the petals, which effectively 
reflect only the red part of the spectrum. Mod- 
ern spectroscopic techniques make use of this 
effect to determine the chemical composition 
and structure of an object. In photoabsorp- 
tion spectroscopy, visible or ultraviolet light, 
or X-rays of a given energy, are shone onto 
an object, and the amount of light absorbed 
is measured as the radiation passes through’. 
When the energy (colour) of the radiation is 
varied, the transmittance — a measure of the 
amount of light that passes through a sample 
— changes in a way that is characteristic of the 
atoms in the sample and their chemical bonds 
to neighbouring atoms. In this way, detailed 
information about the chemical environments 
in an object can be obtained. 

Carbon, nitrogen and oxygen are particularly 
abundant in both natural and technologically 
relevant materials. In order to understand the 
properties of these materials, it is important 
to determine the diverse chemical structures 
that form within them. But to investigate these 
structures using photoabsorption spectroscopy, 
for example, soft X-ray light must be used. Most 
materials are rather opaque to this part of the 
electromagnetic spectrum — even in water, soft 
X-rays travel the distance of less than a hair’s 
breadth. It is therefore difficult to use this tech- 
nique to investigate thicker materials such as 
biological tissues, or the chemistry in the vicin- 
ity of a battery electrode. Analogous electron- 
based spectroscopic methods could in principle 
also be used, but electrons penetrate even less 
deeply into materials than do soft X-rays. 

Huotari et al." have addressed these difficul- 
ties by using hard X-rays, which easily pen- 
etrate large sample volumes and a variety of 
specialized sample environments. In particu- 
lar, they have used hard X-rays in a technique 
called inelastic X-ray scattering spectroscopy’. 
In their system, a sample is partly illuminated 
by a sheet-like, hard-X-ray beam of fixed 
energy (Fig. la). The X-rays are scattered by 
the atomic structures in the illuminated slice 
of the sample. The scattered X-ray light is then 
reflected by a spherically curved crystal, which 
focuses a two-dimensional image of the slice 
onto a detector. Three-dimensional images 
are obtained by stepping a sample through the 
beam and assembling the two-dimensional 
images that are acquired at each step. 

In addition to focusing images of samples 
onto the detector, the curved crystal has a sec- 
ond vital function: because its reflectivity is 
strongly selective, it reflects and images only 
scattered light of a certain energy onto the 
detector. For example, by setting up the system 
so that only light that has the same energy as 
the incident beam is reflected, an image can 
be formed using elastically scattered X-rays 
— those that are scattered without transfer- 
ring part of their energy to the sample. Such 
an image depicts the static (time-averaged) 
structure of the sample. Similarly, the energy 
reflected by the crystal can be set to a given 


energy transfer (the difference in energy of inci- 
dent and scattered X-rays). This allows imaging 
of inelastically scattered X-rays — those that 
transfer part of their energy to the sample, 
for example by exciting vibrations, valence 
electrons or collective motions of electrons. 

Crucially for Huotari and colleagues, hard 
X-rays can also transfer energy to atoms of 
light elements in excitations similar to those 
that occur in photoabsorption spectroscopy. 
The resulting inelastic-scattering spectrum 
thus reveals the local chemical structure of 
those atoms in a material. So, by tuning the 
crystal in their experimental set-up to reflect 
scattered X-rays at transfer energies corre- 
sponding to such excitations, the authors could 
obtain images that reveal details of the chemi- 
cal bonds of a given light element in samples. 

To illustrate the strength of this approach, 
Huotari et al. analysed a test sample consisting 
of diamond enclosed by graphite — two allo- 
tropes of carbon in which the chemical bond- 
ing between atoms is different. Using their 
technique, the authors were able to map out 
clearly the positions of the different allotropes 
in the sample, effectively locating the diamond 
and graphite components on the basis of their 
bond structures (Fig. 1b). No other technique 
could do this, although the diamond could 
perhaps have been detected using classical 
absorption imaging because diamond is denser 
than graphite. But the classical approach would 
provide no information about the chemical 
structures of the allotropes. 

Because inelastically scattered X-ray 


GLOBAL CHANGE 


signals are typically weak, the authors method 
requires the brightest available X-ray sources. 
This currently restricts the use of the technique 
to researchers with access to modern synchro- 
tron radiation sources. Another limitation is 
the spatial resolution of the technique — the 
authors studied millimetre-sized samples, 
but it would be desirable to use the method to 
study microscopic samples. 

Nevertheless, the ability to characterize the 
chemical bonding of light elements inside 
thick samples, and of samples in specialized 
environments (such as in chemical reactors 
or pressurized cells) opens up many applica- 
tions. For example, one could use the authors’ 
method to visualize local bond structure in 
composite materials, or to identify different 
phases in polymer blends. It could also be used 
to study the reactions of carbon and oxygen in 
industrially useful catalytic reactions, such as 
the catalytic partial oxidation of methane* — 
one of the steps involved in the conversion of 
natural gas into liquid fuels. m 


Christian G. Schroer is at the Technische 
Universitit Dresden, Institute of Structural 
Physics, Dresden 01069, Germany. 

e-mail: schroer@physik.tu-dresden.de 


1. Huotari, S., Pylkkanen, T., Verbeni, R., Monaco, G. & 
Hamilainen, K. Nature Mater. 10, 489-493 (2011). 

2. Koningsberger, D. C. & Prins, R. (eds) X-Ray 
Absorption (Wiley, 1988). 

3. Schiilke, W. Electron Dynamics by Inelastic X-Ray 
Scattering (Oxford Univ. Press, 2007). 

4. Grunwaldt, J.-D. & Schroer, C. G. Chem. Soc. Rev. 39, 
4741-4753 (2010). 


The grass response 


A three-year study provides insights into how the productivity of a semi-arid 
rangeland, containing grasses using different photosynthetic pathways, will change 
in a warmer world with more atmospheric carbon dioxide. SEE LETTER P.202 


DENNIS BALDOCCHI 


he grasses that provide forage for most 

of the world’s livestock use either the C, 

or the C, photosynthetic routes to fix 
carbon dioxide into carbohydrates. Both routes 
eventually employ the carbon-fixing enzyme 
ribulose bisphosphate carboxylase. However, 
C, grasses are more efficient photosynthesiz- 
ers as a result of their distinct morphology and 
their use of another enzyme, PeP-carboxylase, 
before this ultimate step of carbon fixation. On 
page 202 of this issue, Morgan et al.' describe 
experiments aimed at assessing the effects 
on mixed C,/C, grasslands of raised levels of 
atmospheric CO, and higher temperatures, 
and, crucially, how these conditions might 
change the plants’ water budgets. 
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As any high-school science student or 
climate-change sceptic knows, CO, is food for 
plants. So the presumption that ‘more is better; 
as we release CO, at an undiminished rate from 
fossil-fuel combustion, is partly true. This effect 
occurs because photosynthesis is stimulated 
when leaves are exposed to above-ambient 
levels of CO,. But the rate of increase in photo- 
synthesis with rising CO, will eventually 
diminish and approach an asymptote as CO, 
levels reach double that of the present day”. 

How other factors affecting leaf photosyn- 
thesis and its conversion of carbohydrates 
into plant matter respond to elevated CO, 
depends on a variety of conditions: for exam- 
ple, whether plants are growing in isolation 
or in groups’; are woody or herbaceous”; 
are growing as monocultures or as mixed 
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by molecules in the petals, which effectively 
reflect only the red part of the spectrum. Mod- 
ern spectroscopic techniques make use of this 
effect to determine the chemical composition 
and structure of an object. In photoabsorp- 
tion spectroscopy, visible or ultraviolet light, 
or X-rays of a given energy, are shone onto 
an object, and the amount of light absorbed 
is measured as the radiation passes through’. 
When the energy (colour) of the radiation is 
varied, the transmittance — a measure of the 
amount of light that passes through a sample 
— changes in a way that is characteristic of the 
atoms in the sample and their chemical bonds 
to neighbouring atoms. In this way, detailed 
information about the chemical environments 
in an object can be obtained. 

Carbon, nitrogen and oxygen are particularly 
abundant in both natural and technologically 
relevant materials. In order to understand the 
properties of these materials, it is important 
to determine the diverse chemical structures 
that form within them. But to investigate these 
structures using photoabsorption spectroscopy, 
for example, soft X-ray light must be used. Most 
materials are rather opaque to this part of the 
electromagnetic spectrum — even in water, soft 
X-rays travel the distance of less than a hair’s 
breadth. It is therefore difficult to use this tech- 
nique to investigate thicker materials such as 
biological tissues, or the chemistry in the vicin- 
ity of a battery electrode. Analogous electron- 
based spectroscopic methods could in principle 
also be used, but electrons penetrate even less 
deeply into materials than do soft X-rays. 

Huotari et al." have addressed these difficul- 
ties by using hard X-rays, which easily pen- 
etrate large sample volumes and a variety of 
specialized sample environments. In particu- 
lar, they have used hard X-rays in a technique 
called inelastic X-ray scattering spectroscopy’. 
In their system, a sample is partly illuminated 
by a sheet-like, hard-X-ray beam of fixed 
energy (Fig. la). The X-rays are scattered by 
the atomic structures in the illuminated slice 
of the sample. The scattered X-ray light is then 
reflected by a spherically curved crystal, which 
focuses a two-dimensional image of the slice 
onto a detector. Three-dimensional images 
are obtained by stepping a sample through the 
beam and assembling the two-dimensional 
images that are acquired at each step. 

In addition to focusing images of samples 
onto the detector, the curved crystal has a sec- 
ond vital function: because its reflectivity is 
strongly selective, it reflects and images only 
scattered light of a certain energy onto the 
detector. For example, by setting up the system 
so that only light that has the same energy as 
the incident beam is reflected, an image can 
be formed using elastically scattered X-rays 
— those that are scattered without transfer- 
ring part of their energy to the sample. Such 
an image depicts the static (time-averaged) 
structure of the sample. Similarly, the energy 
reflected by the crystal can be set to a given 


energy transfer (the difference in energy of inci- 
dent and scattered X-rays). This allows imaging 
of inelastically scattered X-rays — those that 
transfer part of their energy to the sample, 
for example by exciting vibrations, valence 
electrons or collective motions of electrons. 

Crucially for Huotari and colleagues, hard 
X-rays can also transfer energy to atoms of 
light elements in excitations similar to those 
that occur in photoabsorption spectroscopy. 
The resulting inelastic-scattering spectrum 
thus reveals the local chemical structure of 
those atoms in a material. So, by tuning the 
crystal in their experimental set-up to reflect 
scattered X-rays at transfer energies corre- 
sponding to such excitations, the authors could 
obtain images that reveal details of the chemi- 
cal bonds of a given light element in samples. 

To illustrate the strength of this approach, 
Huotari et al. analysed a test sample consisting 
of diamond enclosed by graphite — two allo- 
tropes of carbon in which the chemical bond- 
ing between atoms is different. Using their 
technique, the authors were able to map out 
clearly the positions of the different allotropes 
in the sample, effectively locating the diamond 
and graphite components on the basis of their 
bond structures (Fig. 1b). No other technique 
could do this, although the diamond could 
perhaps have been detected using classical 
absorption imaging because diamond is denser 
than graphite. But the classical approach would 
provide no information about the chemical 
structures of the allotropes. 

Because inelastically scattered X-ray 
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signals are typically weak, the authors method 
requires the brightest available X-ray sources. 
This currently restricts the use of the technique 
to researchers with access to modern synchro- 
tron radiation sources. Another limitation is 
the spatial resolution of the technique — the 
authors studied millimetre-sized samples, 
but it would be desirable to use the method to 
study microscopic samples. 

Nevertheless, the ability to characterize the 
chemical bonding of light elements inside 
thick samples, and of samples in specialized 
environments (such as in chemical reactors 
or pressurized cells) opens up many applica- 
tions. For example, one could use the authors’ 
method to visualize local bond structure in 
composite materials, or to identify different 
phases in polymer blends. It could also be used 
to study the reactions of carbon and oxygen in 
industrially useful catalytic reactions, such as 
the catalytic partial oxidation of methane* — 
one of the steps involved in the conversion of 
natural gas into liquid fuels. m 
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The grass response 


A three-year study provides insights into how the productivity of a semi-arid 
rangeland, containing grasses using different photosynthetic pathways, will change 
in a warmer world with more atmospheric carbon dioxide. SEE LETTER P.202 


DENNIS BALDOCCHI 


he grasses that provide forage for most 

of the world’s livestock use either the C, 

or the C, photosynthetic routes to fix 
carbon dioxide into carbohydrates. Both routes 
eventually employ the carbon-fixing enzyme 
ribulose bisphosphate carboxylase. However, 
C, grasses are more efficient photosynthesiz- 
ers as a result of their distinct morphology and 
their use of another enzyme, PeP-carboxylase, 
before this ultimate step of carbon fixation. On 
page 202 of this issue, Morgan et al.' describe 
experiments aimed at assessing the effects 
on mixed C,/C, grasslands of raised levels of 
atmospheric CO, and higher temperatures, 
and, crucially, how these conditions might 
change the plants’ water budgets. 
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As any high-school science student or 
climate-change sceptic knows, CO, is food for 
plants. So the presumption that ‘more is better; 
as we release CO, at an undiminished rate from 
fossil-fuel combustion, is partly true. This effect 
occurs because photosynthesis is stimulated 
when leaves are exposed to above-ambient 
levels of CO,. But the rate of increase in photo- 
synthesis with rising CO, will eventually 
diminish and approach an asymptote as CO, 
levels reach double that of the present day”. 

How other factors affecting leaf photosyn- 
thesis and its conversion of carbohydrates 
into plant matter respond to elevated CO, 
depends on a variety of conditions: for exam- 
ple, whether plants are growing in isolation 
or in groups’; are woody or herbaceous”; 
are growing as monocultures or as mixed 


species*”; are growing under natural or 
managed conditions* °; utilize the C, or the C, 
photosynthetic pathway””; are evaluated after 
the course of a growing season or after many 
years*”*; or are exposed to a warmer or drier 
climate’. 

Morgan et al.’ disentangle part of this web 
of potentially interacting factors by presenting 
convincing evidence that, compared with C, 
grasses, C, grasses growing under semi-arid 
climatic conditions will prosper in a world 
with higher CO, concentrations and warmer 
temperatures. In principle, this response 
depends on water — the leaf pores, known 
as stomata, of C, plants open less widely than 
those of C, plants, so C, plants tend to lose less 
water vapour through transpiration. 

Morgan and colleagues’ take-home message 
is the following: water savings proffered to 
C, grasses from partial stomatal closure, 
induced by high CO, concentrations, offset 
any enhancement in evaporation brought 
about by coincident warming. They draw this 
conclusion from a novel, multifactorial experi- 
ment that exposed plots of C; and C, grasses 
to combinations of ambient or elevated CO, 
and/or temperature for three years. For the sin- 
gle-factor plots, they found that elevated CO, 
favours the productivity of C; grasses and that 
elevated temperature favours that of C, grasses, 
as expected’, However, a combination of ele- 
vated CO, and higher temperature favours 
increased C, productivity because water-use 
efficiency (the ratio between photosynthesis 
and transpiration) in C, grasses is greater than 
in C, grasses. 

Thousands of papers and hundreds of 
reviews‘ have been published on how plants 
respond to elevated CO,. So why is this 
particular study noteworthy? 

First, Morgan and colleagues’ provide one of 
the first and best views of how a mixed-grass 
ecosystem (Fig. 1) growing in a semi-arid 
climate will respond to future CO, and climatic 
conditions. The task of divining this informa- 
tion is not trivial, because the perturbation of 
an ecosystem engenders a suite of nonlinear, 
positive and negative feedbacks that operate 
across an array of timescales*. Second, their 
data counter the conventional wisdom derived 
from physiological studies that leaf photosyn- 
thesis and water-use efficiency in C, grasses 
benefit less from high CO, than in C; grasses’. 
Third, the interactive effects of CO,, tempera- 
ture and soil moisture on C,; and C, grasses 
have until now remained elusive. Without the 
warming treatment, Morgan and colleagues’ 
results would only have confirmed knowl- 
edge gained from previous meta-analyses that 
compared the responses of C, and C; grasses to 
raised CO, — both C; and C, plants respond 
positively to enhanced CO, (ref. 7). 

However, users of this new information! 
must appreciate the conditional aspects 
of Morgan and colleagues’ results. Short- 
term studies of elevated CO,, over single 
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Figure 1 | Mixed-grass prairie. This grassland in Wyoming is covered by a variety of plants that use 
either C, or C, photosynthetic routes, and is the kind of ecosystem studied by Morgan and colleagues’. 


to several growing seasons, manipulate the 
ecosystem’s eco-physiological ‘knobs, such as 
stomatal conductance, leaf-area index and the 
soil—water balance. In such circumstances, 
elevated CO, will indeed cause stomata to 
close partially, restricting leaf transpiration 
and conserving soil moisture. But enhanced 
growth increases the population of leaves and 
so counteracts potential soil-moisture savings 
— up toa limit. 

Whether this net moisture saving is critical 
to plant productivity depends on how much 
rain falls during the growing season. In a more 
humid environment, the water saving through 
CO,-induced stomatal closure in C, plants may 
be less consequential, as shown in 2009 when 
precipitation at Morgan and colleagues’ test 
site was greater than normal’. If there is severe 
drought, however, a rise in CO, may not pro- 
vide enough water savings to make a differ- 
ence. When warming is added to the mix, it 
can counter the leaf’s physiological response 
to elevated CO,. Concurrent warming pro- 
motes transpiration by increasing the gradi- 
ent in humidity between the leaf surface and 
the atmosphere. Warming also increases rates 
of leaf and root respiration, thereby diminish- 
ing net carbon acquisition and reducing plant 
growth. Consequently, the answer to the ques- 
tion, ‘How will this mixed-grass ecosystem 
respond to changes in mean CO,?’ will depend, 
in part, on whether or not climatic extremes in 
the moisture and thermal environments occur 
and physiological thresholds are crossed. 

In contrast to short-term experiments, stud- 
ies lasting longer than a decade manipulate 
biogeochemical and ecological knobs such as 
leaf and root nutrition, species composition 
and plant competition. Downregulation of 
photosynthesis typically occurs in long-term 
studies of elevated CO,, because the nitrogen 
needed for ribulose bisphosphate carboxylase 
becomes scarcer as plant biomass expands and 


sequesters the limited supply of nitrogen avail- 
able to the plants’ roots”**. These longer-term 
responses are not captured in Morgan and col- 
leagues’ three-year study. Hence, additional 
years of studying this ecosystem may produce 
alternative outcomes. Moreover, findings that 
apply to mixed C,/C, grasslands may not relate 
to other ecosystems such as forests’. 

What do these results' suggest for the future? 
At present, C, grasses tend to inhabit warmer 
and drier climates than do C, grasses””. If the 
trends in atmospheric warming and increas- 
ing CO, persist over decades, a gradual shift 
in species composition may occur, favouring 
C, grasses. Any change in the mixture of C, 
and C, grasses will have a biophysical feed- 
back on the climate system by changing how 
solar energy is absorbed and partitioned into 
heating the air and evaporating soil moisture’. 
Such changes in species composition may also 
affect ecosystem services — for example, the 
amount and availability of forage for grazing 
cattle, bison and antelope. m 
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A continuum model for tumour 


suppression 


Alice H. Berger', Alfred G. Knudson? & Pier Paolo Pandolfi' 


This year, 2011, marks the forty-year anniversary of the statistical analysis of retinoblastoma that provided the first 
evidence that tumorigenesis can be initiated by as few as two mutations. This work provided the foundation for the 
two-hit hypothesis that explained the role of recessive tumour suppressor genes (TSGs) in dominantly inherited cancer 
susceptibility syndromes. However, four decades later, it is now known that even partial inactivation of tumour 
suppressors can critically contribute to tumorigenesis. Here we analyse this evidence and propose a continuum 
model of TSG function to explain the full range of TSG mutations found in cancer. 


1900, it was only then, after the rediscovery of Gregor Mendel’s 

once-ignored nineteenth century work, that hereditary pre- 
disposition to cancer could be rationalized. By then it was also known 
that the pattern of chromosomes in cancer cells is abnormal. The next 
contribution to understanding cancer genetics was made by Theodor 
Boveri, who proposed that some chromosomes might stimulate cell 
division and others might inhibit it, but his idea was long overlooked. 
Now we know that there are genes of both types. 

In this review, we summarize the history of the study of the latter type 
of genes, tumour suppressor genes (TSGs), and the evidence supporting 
a role for both complete and partial tumour suppressor inactivation in 
the pathogenesis of cancer. We integrate the classical ‘two-hit’ hypo- 
thesis of tumour suppression with a continuum model that accounts for 
subtle dosage effects of tumour suppressors, and we discuss other excep- 
tions to the two-hit hypothesis such as the emerging concept of ‘obligate 
haploinsufficiency’ in which partial loss of a TSG is more tumorigenic 
than complete loss. The continuum model highlights the importance of 
subtle regulation of TSG expression or activity such as regulation by 
microRNAs (miRNAs). Finally, we discuss the implications of this 
model for the diagnosis and therapy of cancer. 


A Ithough hereditary predisposition to cancer was known before 


The two-hit hypothesis 
The first evidence for a genetic abnormality as a cause of cancer came with 
the discovery, in 1960, of the abnormal ‘Philadelphia’ chromosome in 
chronic myelogenous leukaemia cells’. Later, in 1973, it was discovered 
that this chromosome was a translocation between chromosomes 9 and 22 
(ref. 2) and in 1977, a translocation of chromosomes 15 and 17 was iden- 
tified in acute promyelocytic leukaemia’. Eventually, the genes at the break- 
points of these translocations were cloned: BCR/ABL1 in 1983 (ref. 4) and 
PML/RARA in 1991 (ref. 5). Meanwhile, seminal work demonstrated that 
the cancer-causing gene within the avian sarcoma virus genome, v-src, was 
actually a co-opted and mutated version of a normal cellular gene termed a 
proto-oncogene, now called c-src*’. These observations demonstrated that 
normal cellular genes, when mutated or altered, are able to cause cancer, 
and were followed by the identification of numerous other cellular onco- 
genes that are activated by mutation, chromosome translocation or amp- 
lification. One of Boveri’s predictions was proven to be correct. 

In the 1980s, scientists verified Boveri’s other prediction with the iden- 
tification of a second class of genes involved in cancer, tumour suppressor 


genes, that inhibit cancer development and oppose oncogene function. In 
a normal cell, a physiological balance between tumour suppressors and 
oncogenes maintains homeostasis and allows carefully regulated cell pro- 
liferation without unrestrained malignant tumour growth. Somatic cell 
fusion experiments pointed to the existence of such TSGs, because fusion 
of a normal cell with a malignant cell could revert the malignant cell to a 
normal phenotype’. One paradox that puzzled cancer researchers was that 
cancer susceptibility syndromes usually display a dominant mode of 
inheritance, whereas the proposed tumour suppressor genes seemed to 
function in a recessive manner in the in vitro cell fusion experiments. 

The resolution of this problem was provided by the analysis of a 
tumour of children, retinoblastoma, which is sometimes present even 
at birth’. Statistical modelling indicated that hereditary cases probably 
developed after only one somatic mutational event. It was proposed that 
one mutant allele is inherited and the other is generated somatically 
during growth of the developing eye. Because of the high chance of 
the second ‘hit’ occurring, almost all individuals with the inherited first 
hit (the mutant allele) will develop retinoblastoma, and therefore the 
cancer-susceptibility phenotype is inherited in a dominant manner 
(Fig. 1). In contrast, tumour initiation requires both hits, and therefore 
tumorigenesis is recessive. A similar mechanism could be extended to 
sporadic retinoblastoma, the only difference being that in the sporadic 
cases both the first and second hits occur in somatic tissue. At the time of 
the original analysis, it was not known whether these two hits occurred 
in the same or distinct genes, but it was clear that the search for the 
secondary mutation would start with the disease gene itself. 

The success of the two-hit model came with the identification of 
deletion of a locus on chromosome 13 in retinoblastoma tumours and 
the subsequent cloning of the RBI gene as mutated in familial retino- 
blastoma’®”’’. This discovery was made possible by the development of 
restriction fragment length polymorphism (RFLP) mapping technology. 
Using this approach, it was shown that many families with familial 
retinoblastoma did indeed harbour mutations or deletions of RBI in 
the germ line (the first hit), and tumours from retinoblastoma patients 
nearly always contained mutation or loss of the other RB1 allele’°"”’ (the 
second hit). This latter event most often results from somatic mitotic 
recombination, whereas in a minority of cases, the normal allele acquires 
a distinct mutation. The role of recessive TSGs in dominantly inherited 
cancer susceptibility syndromes could then be understood, and Boveri's 
second prediction was substantiated. 
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Two-hit paradigm 
(RB) (p53) 
Figure 1 | Paradigms of tumour suppression. A black gradient represents a 
continuum of expression related to the number of alleles present (grey 
numbering). Left, the two-hit paradigm as exemplified by the tumour 
suppressor, RB. Loss of one allele induces cancer susceptibility; loss of two 
alleles induces cancer. Middle, classical haploinsufficiency. Loss of one allele is 
sufficient for induction of cancer. Right, quasi-sufficiency and obligate 


These findings and others not only established the two-hit hypothesis 
as a model that explained hereditary cancer susceptibility, but also 
provided a proof-of-principle that TSGs could be identified by the study 
of chromosomal deletions and genetic linkage in hereditary cases of a 
cancer. Such a strategy was critical to the ultimate identification of TP53 
(also called p53) as a TSG** and was used successfully in the identifica- 
tion of other TSGs such as APC’’, BRCA1 (ref. 20) and BRCA2 (ref. 21). 
Now the advent of high-throughput sequencing and genome-wide copy 
number profiling is allowing further systematic identification of regions 
of mutation and deletion in tumour genomes”*”’. 

It should be noted that it was initially proposed that two hits would be 
sufficient for the initiation of some tumours such as paediatric cancers, but 
additional mutations would be necessary for others. It is now thought that 
non-hereditary cancers require approximately four or more distinct 
mutation events that result in perturbation of critical cellular signalling 
pathways such as phosphoinositide-3-kinase (PI(3)K), p53 or RB”. 
Malignancy is thought to result from an iterative process of somatic 
mutation followed by clonal expansion’*. In the context of hereditary 
cancer syndromes, full inactivation of the involved disease gene may be 
the rate-limiting step for tumour initiation, but other events may occur, 
particularly to promote or enhance tumour progression. In this respect, 
the two-hit hypothesis should be applied directly to the TSG, with two hits 
representing the number of events necessary for inactivation of the TSG, 
but not necessarily for tumorigenesis. Haploinsufficiency, as we discuss 
below, must also be interpreted in this context; in many cases, a gene might 
be haploinsufficient for a specific cellular function, but TSG haploinsuffi- 
ciency may promote tumour formation only in the context of other 
genetic or environmental insults. 


Haploinsufficiency and quasi-sufficiency 
The two-hit hypothesis clearly explained the differences in tumour number 
and age of onset of retinoblastoma in hereditary versus sporadic cases of 
retinoblastoma. Moreover, it led to the eventual cloning of RB1 as the first 
bona fide human TSG. The hypothesis has been useful in cloning genes for 
hereditary cancers and for their non-hereditary counterparts. However, 
this success has led to a questionable dogma in the field that all important 
tumour suppressors should behave according to the two-hit model. 
Investigators sought to apply the principles learned from the two-hit 
model to sporadic cancers in order to identify novel TSGs. Many recur- 
ring regions of chromosomal deletion have been identified in sporadic 
cancers, indicative of the presence of a TSG. Unfortunately, it has 
become apparent that not all regions of consistent loss are accompanied 
by obvious aberrations on the other allele, raising the question of 


164 | NATURE | VOL 476 | 11 AUGUST 2011 


Haploinsufficiency 


Quasi-sufficiency + 
obligate haploinsufficiency 
(PTEN) 


haploinsufficiency. Quasi-sufficiency refers to the phenomenon whereby 
tumour suppression is impaired after subtle expression downregulation 
without loss of even one allele. Obligate haploinsufficiency occurs when TSG 
haploinsufficiency is more tumorigenic than complete loss of the TSG, usually 
due to the activation of fail-safe mechanisms following complete loss of TSG 
expression. 


whether these deletions may represent ‘passenger’ events not relevant 
to carcinogenesis. Indeed, very few of the identified regions of chromo- 
somal losses have yielded clear TSGs in those loci that conform to the 
two-hit hypothesis. This finding has led many to conclude that the 
relevant tumour suppressors in those regions have not been identified 
or do not exist, when in fact many genes in those loci are known to have 
tumour suppressive properties in vitro or in vivo. 

Alternatively, these regions could harbour bona fide TSGs but may 
represent regions/genes where single-copy mutation or loss has a role in 
tumorigenesis. These single-copy events may even be selected for during 
tumorigenesis instead of biallelic TSG loss. One possibility is that the 
initial lesion, when reduced to homozygosity, may lead to cell death or 
senescence. The lethality could be due to homozygosity of the disease 
gene itself or of other distinct genes included in the initial hit, because 
recombination is a principal second hit in tumorigenesis”*. Such a scenario 
would result in an obligate haploinsufficiency in which selection pressure 
during tumorigenesis favours partial, but not complete, loss of the TSG. 

A second possibility is that the single-copy mutation of a TSG func- 
tions as a dominant-negative mutation towards the wild-type gene/ 
protein. After mutation of one allele, the mutant protein product inter- 
feres with the normal wild-type protein produced from the remaining 
wild-type allele. Because the complete normal function of the TSG is 
already impaired with only one hit, there is no selection pressure by the 
tumour for loss or mutation of the wild-type allele. 

A third possibility is that a gene or genes in the region of consistent 
deletion shows haploinsufficiency for tumour suppressor function and 
single-copy loss of the TSG is sufficient for aberrant TSG function and 
promotion of cancer (Fig. 1). In contrast with classical TSGs that are 
insensitive to large reductions in expression or activity, the function of 
haploinsufficient TSGs is impaired by a 50% partial reduction in 
expression or activity, or sometimes by subtler changes, as in a quasi- 
insufficiency of TSG function that we discuss below. 

The role of haploinsufficiency in cancer has been met with scepticism, 
despite the well-established role of haploinsufficiency in numerous 
developmental disorders such as, for instance, aniridia (caused by hap- 
loinsufficiency for PAX6) and Grieg’s syndrome (caused by haploinsuf- 
ficiency of GLI3)’’. Just as increased dosage of genes can result in 
developmental syndromes, such as Down’s syndrome, increased dosage 
or activity of oncogenes is an established genetic mechanism of malig- 
nancy*, exemplified by the role of MYC amplification or RAS protein 
hyperactivity in cancer. However, the notion that subtle decreases in 
gene dosage or protein activity can be relevant to cancer has gained only 
limited acceptance in the scientific community. 
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Figure 2 | Tissue specificity and context dependency of tumour 
suppression. The phenotypic outcome of a reduction in PTEN expression is 
differentially manifested depending on tissue type and genetic background. 
BM, bone marrow; HSC, haematopoietic stem cell; LA, lymphadenopathy; PD, 
poorly differentiated; SM, splenomegaly; WD, well-differentiated. The effect of 
complete loss of PTEN is highly context-dependent due to the obligate 
haploinsufficiency caused by PTEN loss-induced cellular senescence (PICS). 
Data summarized here come from multiple groups and studies of genetically 


engineered mice with differing PTEN alleles and expression** #707), 


One reason for this scepticism stems from the difficulty in definitively 
proving that a haploinsufficient TSG is involved in tumorigenesis. 
Whereas the rare TSGs that fully conform to the two-hit hypothesis 
can be identified by their homozygous deletion or mutation in cancer, 
haploinsufficient TSGs cannot be identified in such a manner. 
Moreover, large regions of the genome, encompassing many genes, 
can be targeted by allelic loss and likewise, many genes may acquire 
somatic mutations during the course of tumour development. It is 
assumed that only a portion of these genes is responsible for the cancer, 
while the rest are passenger mutations. What then could be the gold 
standard by which to determine which gene or genes in the region are 
causative and which are non-involved bystanders? Unlike the analysis of 
traditional TSGs, no one assay or approach offers definitive proof that a 
haploinsufficient TSG is causally involved in tumorigenesis. Instead, an 
integrated approach involving human genetic analysis, in vitro and ex 
vivo functional studies, and murine cancer genetic modelling is critical 
to ascertain whether a gene may function as a haploinsufficient gene in 
cancer. Although there are caveats of each analysis on its own, a body of 
evidence from each of these analyses can strongly implicate a gene in 
dosage-sensitive tumour suppression. 

Throughout this review, we use PTEN as a model dosage-sensitive 
TSG, but numerous other TSGs exhibit haploinsufficiency and dosage- 
sensitivity. Notably, p53 shows haploinsufficiency. Mice with heterozyg- 
ous p53 mutation (p53*/~) show an intermediate survival to that of p53 
homozygous mutants and wild-type mice, and tumours that develop in 
the p53°'~ animals do not always display loss of the remaining wild- 
type allele’. Similarly, tumours in patients with Li-Fraumeni syndrome, 
a cancer susceptibility syndrome caused by germ line mutation of p53, 
do not always exhibit loss of the wild-type p53 allele, suggesting that 
haploinsufficiency of p53 may be sufficient for tumour initiation in 
humans as well*®. A caveat with these analyses is that the effect of p53 
haploinsufficiency on cancer initiation in mouse models and Li- 
Fraumeni patients may be due to complex non-cell autonomous effects 
stemming from the partial loss of p53 throughout the body. However, ex 
vivo analysis of murine p53 heterozygous thymocytes showed that these 
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cells have an impaired apoptotic response to ionizing radiation or etopo- 
side*'. Similarly, a p53-deleted HCT116 isogenic cell line expressed only 
25% the normal level of p53 messenger RNA and showed impairment in 
induction of p53-responsive genes and apoptosis after exposure to ultra- 
violet radiation”. These studies demonstrate that reductions in p53 
dosage and function can have an impact on a cell’s ability to respond 
to oncogenic stimuli and strengthen the notion that p53 haploinsuffi- 
ciency may drive tumorigenesis. 

Similarly, other classical cancer susceptibility genes such as BRCA1 
and BRCA2 may show haploinsufficiency and/or dosage effects. Primary 
breast and ovarian cells from patients with heterozygous germ line 
mutation of BRCA1 or BRCA2 have altered mRNA profiles compared 
to BRCA wild-type cells, indicating that single hits in these genes can 
confer phenotypic differences that could, in principle, have an effect on 
breast and/or ovarian tumorigenesis**. Indeed, ex vivo studies of human 
BRCAI1 mutant breast cells have found that these cells show enhanced 
colony formation potential** and show impaired lineage commitment in 
differentiation assays*. Although tumour formation in BRCAI and 
BRCA2 carriers does appear to require the second hit, these one-hit 
premalignant changes could have an impact on tumorigenesis by either 
promoting loss of the second TSG allele or of other TSGs, or by altering 
the cellular phenotype to predispose to certain tumour subtypes. 

Another haploinsufficient TSG is the transcription factor PAX5. PAX5 
is found mutated, deleted or translocated in approximately 30% of acute 
lymphoblastic leukaemia cases, but the aberrant mutations/deletions are 
almost invariably mono-allelic and seem to function as hypomorphs, not 
dominant negative alleles**. Similarly, the E3 ligase gene FBW7 (also 
known as FBXW7) is mutated in approximately 10% of human colorectal 
cancers*’. In 70% of these cases, the mutations are mono-allelic, indi- 
cative of haploinsufficiency. Conditional mono-allelic deletion of Fbw7 
in the murine intestine cooperated with the tumorigenic APC” allele in 
intestinal tumorigenesis, albeit to a lesser degree than biallelic deletion of 
Fbw7, indicating that Fbw7 is dosage-sensitive’’. 

Like complete loss of TSGs, the effect of TSG haploinsufficiency can 
be highly tissue-specific and context-dependent (Fig. 2). Thus, the 
cellular and molecular context in which TSG function is altered will 
determine the outcome of such impairment of TSG function. The dif- 
fering outcomes of TSG expression level in differing situations may 
reflect distinct thresholds of protein expression or activity needed for 
certain processes or in differing cell contexts. For example, in one cell 
type there may be other compensatory proteins that mask the potential 
phenotype caused by TSG haploinsufficiency, whereas in other cells 
these proteins are not expressed and so the haploinsufficiency of the 
TSG manifests as tissue-specific cancer susceptibility. 


Two hits for tumour suppression 


The genetic background of the individual or tumour will also have an 
impact on the phenotypic outcome of TSG haploinsufficiency. In some 
cases, combinatorial effects could arise whereby haploinsufficiency or 
mutation of other genetic loci is required for a phenotype caused by 
TSG haploinsufficiency. A powerful example of this context dependency 
is provided by the interaction between Pten deficiency or haploinsuffi- 
ciency and the presence or absence of mutated p53 in mouse models 
(ref. 38; Fig. 2). In the context of wild-type p53, Pten haploinsufficiency 
is actually more tumorigenic in the prostate than complete loss of Pten, 
because the latter triggers a p53-dependent fail-safe senescence mechanism 
called Pten-loss-induced cellular senescence (PICS). Haploinsufficiency of 
Pten enhances proliferation without activating PICS, making partial loss of 
Pten more detrimental in this context than complete loss of Pten (Fig. 2). 
Similarly, complete loss of Pten in the murine haematopoietic compart- 
ment induces haematopoietic stem cell exhaustion or bone marrow failure, 
unless loss of Pten is accompanied by other genetic events”. After accu- 
mulation of aneuploidy or other mutations, such as loss of p53, complete 
loss of Pten in the blood induces fatal leukaemias**“° (Fig. 2). 

This phenomenon defines a novel paradigm of obligate haploinsuffi- 
ciency in which selection pressure during tumour growth may favour 
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haploinsufficiency of a TSG over complete loss in certain scenarios 
(Fig. 1). However, in advanced cancers with p53 mutation or loss, 
PICS cannot be induced and complete loss of Pten enhances prolifera- 
tion and tumorigenesis to a greater degree than Pten haploinsufficiency. 
This remarkable finding is likely to explain why complete loss of PTEN is 
typically restricted to advanced cancers and also emphasizes the import- 
ance of single-hit haploinsufficiency of TSGs in cancer initiation. 
Understanding the combinatorial and contextual dependencies of genetic 
events should not only enhance our ability to effectively treat tumours 
with differing combinations of genetic hits, but will also inform chemo- 
preventative strategies for eradicating cancer by attacking mechanisms 
necessary for its development. 

Like PTEN, many other TSGs have obligate haploinsufficiency. One 
example is the microRNA-processing enzyme DICERI. Mono-allelic 
deletion or mRNA downregulation of DICER1 is observed in diverse 
cancers*!“*, In murine models, mono-allelic conditional deletion of 
Dicer1 enhances lung tumorigenesis driven by oncogenic Kras"', sarcoma 
development induced by Kras©’?? and loss of p53 (ref. 41), and retino- 
blastoma formation induced by deletion of Rb1 and p107 (ref. 42). In the 
lung model, complete inactivation of Dicer1 improved survival compared 
to animals with mono-allelic Dicer1 deletion*'. Tumours that did develop 
in the Dicer!" animals retained one wild-type allele and partial 
Dicer1 expression, strongly suggesting that partial loss of Dicerl pro- 
motes tumorigenesis. 

A recent in vivo short hairpin RNA (shRNA)-based screening 
approach in mice identified lymphoma tumour suppressors, including 
one, the cell cycle regulator Rad17, which seemed to function as a TSG 
with obligate haploinsufficiency*’. The use of multiple hairpins targeting 
Rad17 led to the discovery that only hairpins that partially suppressed 
Rad17 promoted lymphomagenesis, whereas hairpins that completely 
repressed Rad17 expression were selected against in the lymphomas. 
This study highlights the utility of shRNA knockdown and screening 
technologies for the identification of haploinsufficient TSGs. 

NPM1, found mutated and translocated in haematopoietic malignancies, 
also shows obligate haploinsufficiency. Whereas single-allelic deletion of 
Npm1 induces tumour formation in mice, biallelic loss induces embryonic 
lethality and is also incompatible with cell growth after conditional deletion 
ex vivo’’. Although more data are needed the tumour-suppressive isoform 
of Trp63, TAp63, may also show obligate haploinsufficiency in mice; 
heterozygosity for TAp63 promoted tumorigenesis to a greater degree 
than complete loss of TAp63 in genetically engineered mice, and tumours 
that developed in the heterozygous mice did not display loss of the wild- 
type allele*®. Similarly, PinX1, recently identified as a haploinsufficient 
tumour suppressor’’, may also show obligate haploinsufficiency; hetero- 
zygous loss of PinX1 in a mouse model promoted cancer development in 
multiple organs, and tumours did not lose the wild-type allele or expres- 
sion of PinX1 (ref. 47). 

A genome-wide screening study in yeast indicated that about 3% of 
the genes in the yeast genome showed haploinsufficiency when assayed 
under specific growth conditions**. These haploinsufficient genes were 
enriched for ribosomal genes and genes that are highly expressed. 
Currently, mammalian tumour suppressors that show haploinsuffi- 
ciency do not seem to conform to one specific functional class. 
Besides p53 and PTEN, dozens of other TSGs show haploinsufficiency 
for tumour suppression*”°®, many of which we have discussed above. 
These TSGs come from many functional classes, including transcription 
factors (for example, PAX5 and TAp63), cell cycle checkpoint genes 
(RAD17), E3 ligases (FBW7), ribosomal or associated proteins 
(NPM1), and other diverse genes such as DICER1 and PINX1. 


A continuum model for tumour suppression 

Rather than TSG function and activity following discrete, step-wise 
changes, evidence is emerging that supports the dosage-dependency 
of TSG function. Data from an allelic series of targeted Pten alleles that 
results in varying levels of Pten in vivo in genetically engineered mice?!” 
demonstrate the tight correlation between Pten expression level and 
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function. Taking advantage of a null Pten allele in combination with a 
Pten ‘hy allele, which is expressed at a lower level than the wild-type allele, 
an allelic series was created in which Pten expression follows a decreasing 
pattern in the order Pten*!* > Pten™'* > Pten*'~ > Pten'-. 
Surprisingly, Pten™’* animals, with cells expressing 80% of the normal 
level of Pten, did display many of the phenotypes observed in Pten*/~ 
animals, such as lymphadenopathy, splenomegaly and mammary gland 
tumours”. Moreover, survival of Pten™”’* animals is compromised com- 
pared to Pten*'* controls, but not to the same degree as the survival 
impairment in Pten*’~ animals™. Thus, the tumour incidence and sur- 
vival of the animals correlate with the Pten expression level*’. Critically, 
mammary tumours from Pten™’* animals do not show loss of Pten 
expression or loss of the wild-type Pten allele and exhibit a Pten dose- 
dependent increase in activation of Akt, further demonstrating that a 
subtle reduction in Pten dose can result in deregulated signalling and 
cancer’. Similarly, in the murine prostate, a progressive lowering of 
Pten levels correlates with Akt activation, prostate enlargement, prolifera- 
tion of prostate cells and tumorigenesis’'. These in vivo studies demon- 
strate that even a subtle 20% reduction of protein level could constitute an 
important hit involved in the development of cancer. We termed this state: 
TSG ‘quasi-insufficiency’”” (Fig. 1). 

Thus, protein function can be a continuum that is related to the level 
of expression or activity of the TSG rather than to discrete step-by-step 
changes in gene copy number. We therefore propose a shift from the 
classical discrete model of tumour suppression to a continuum model 
(Fig. 3). This model takes into account the fact that subtle regulation of 
TSGs can have profound consequences on cancer susceptibility and/or 
progression, and in turn has important implications for cancer diagnosis 
and therapy. Note that whereas the function of PTEN seems to be 
strongly coupled to its expression level, for other TSGs it will be the 
activity of the protein and not the expression level that may be critical. 
This situation is readily observed in tumours with p53 mutation where 
the mutant allele is highly expressed but functionally inactive or acting 
as a dominant negative. 

Like the effect on TSGs, dosage can be extremely important for regu- 
lating the cellular effects of oncogene expression. Markedly different 
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Figure 3 | The continuum model of tumour suppression. The classical 
discrete, step-wise model of tumour suppression (left) is contrasted with a 
continuum model of tumour suppression and oncogenesis (centre and right, 
respectively). In the discrete model, tumorigenesis is induced by either 
complete loss ofa TSG (two-hit paradigm, dark blue) or after single-copy loss of 
a TSG (haploinsufficiency, light blue). In contrast, we propose a continuum 
model (centre and right), in which tumour suppression is related to a 
continuum of TSG expression, rather than to discrete changes in DNA copy 
number. A continuum of increasing TSG expression will generally be negatively 
correlated with malignancy (centre, light green), whereas increasing oncogene 
expression will generally be positively correlated with malignancy (right, red). 
A linear relationship is depicted for schematic purposes, but the dose-response 
relationship need not be linear. In some cases, fail-safe mechanisms are induced 
by complete loss of TSG expression or by massive oncogene overexpression. In 
these cases, complete loss of TSG expression (centre, dark green) or massive 
overexpression of an oncogene (right, orange) will be negatively correlated with 
malignancy, as shown. 
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consequences for the cell and for cancer development can occur depend- 
ing on the level of oncogene expression. For example, high expression of 
oncogenic, mutation of RAS-family genes drives senescence, not prolif- 
eration®. Similarly, robust expression of MYC induces apoptosis, not 
proliferation®’. These examples mirror the paradigm of PICS and obligate 
haploinsufficiency and suggest that tumours may select for optimal 
expression of oncogenes. Data to support this notion exist already in 
the rampant amplification of mutated oncogenes, such as the amplifica- 
tion of the mutated EGFR gene in human lung cancer****. Such data 
indicate that it is not only the mutation of an oncogene that is important 
for cancer, but also its expression level. In this respect, the continuum 
model could also apply to oncogenes (Fig. 3). 

It seems now that some TSGs are exquisitely sensitive to dose (for 
example, PTEN), some are intermediately sensitive (for example, p53) 
and others are more resistant to expression level and/or activity changes 
(for example, RB1). Those that are most resistant to changes in express- 
ion or activity, such as RB1, will be most likely to conform to the two-hit 
hypothesis, whereas those that are exquisitely sensitive to dose may 
exhibit frequent monoallelic deletion/mutation or be misregulated in 
cancer via other means, such as miRNA overexpression. Moreover, 
tumour suppressors that show obligate haploinsufficiency due to the 
lethality of homozygous inactivation of the TSG will never show com- 
plete inactivation in cancer unless there are other context- or genotype- 
dependent events that favour their complete inactivation (as in the case 
of Pten/p53 discussed above). 

The dogma that every TSG must behave according to the two-hit 
paradigm could significantly slow the pace of cancer research if poten- 
tially critical genes are dismissed as unimportant. The pace of cancer 
drug development will in turn be delayed by a lack of appreciation of the 
involvement of these potentially druggable pathways in tumour 
development and maintenance. A very real challenge for cancer biolo- 
gists in the next decade will be first to define the TSGs affected by 
haploinsufficiency and then to model these subtle dosage effects in a 
way that can yield quantitative models of cancer risk or outcome. 
Because human genetics analysis alone is often not enough to identify 
critical haploinsufficient genes unequivocally, mouse models will prove 
essential to define which genes show haploinsufficiency for TSG func- 
tion and in which specific cellular and genetic contexts. 


Regulation of TSG activity 

The continuum model of tumour suppression holds that precise regu- 
lation of TSG expression and activity is critical and thus, mechanisms of 
regulation of expression and function may powerfully contribute to 
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tumour suppression. One of the most widespread networks of expression 
regulation is the post-transcriptional regulatory network of miRNAs and 
competing endogenous RNAs (ceRNAs). In addition to these networks, 
TSG dosage/activity is regulated by transcriptional, translational and 
post-translational mechanisms (Fig. 4). These networks open up the 
realm of genes that could be contextually important for tumorigenesis, 
especially when considering mechanisms that tune TSG activity and not 
only those that function as on/off switches. 

As an example, a powerful PTEN-targeting miRNA network was 
recently identified. miRNAs are small non-coding RNA molecules that 
regulate protein expression through inhibition of protein translation or 
induction of direct cleavage of their mRNA targets via RISC*’. Many of 
the PTEN-targeting miRNAs are amplified or overexpressed in cancer 
such as miR-26 in glioma”* and miR-22 in prostate cancer*. The regu- 
lation of coding RNAs by miRNAs is made additionally complex 
through the interaction of miRNAs with ceRNAs®*'. We coined the 
term “competing endogenous RNA” (ceRNA) to refer to an RNA— 
coding or noncoding—that shares miRNA binding sites with another 
RNA with which it competes for miRNA binding®**’. Any two or more 
mRNAs with shared miRNA recognition elements (MRE) can act as 
ceRNAs towards each other by competing for miRNA binding and 
thereby co-regulating each other’s expression. As the expression of the 
ceRNA increases, the miRNA binding sites on the cRNA compete with 
the miRNA binding sites on the other transcript, and less miRNA is 
available for binding to the TSG transcript (Fig. 4). ceRNA expression 
(for example, expression of the PTEN pseudogene®) hence results in 
sustained TSG expression by preventing miRNA binding to the TSG and 
subsequent mRNA or protein downregulation. Conversely, loss of 
ceRNA expression could result in low TSG expression through diversion 
of additional miRNA molecules to the TSG transcript®*'. Even non- 
coding ceRNAs that are not expressed at the protein level could function 
as TSGs by regulating expression of protein-coding TSGs, so mutation 
or deletion of ceRNA genes could be important in cancer®*". 


Implications for cancer susceptibility and therapy 

An appreciation of the continuous nature of tumour suppressive function 
must in turn lead to a re-evaluation of the role of TSG dosage in cancer 
susceptibility, diagnosis and treatment. Subtle variation in TSG expres- 
sion or activity could underlie genetic variation in cancer susceptibility in 
the population. Inherent, constitutive differences in gene expression 
could be caused by polymorphic variation in TSG promoter regions or 
in miRNA binding sites or other regulatory regions. Importantly, 
synonymous single nucleotide polymorphisms (SNPs) that do not 
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Figure 4 | Mechanisms of regulation of TSG dosage. The interaction 

between coding and non-coding factors determines final TSG dosage. Classic 
mechanisms such as DNA copy number (1), transcriptional regulation (2) and 
epigenetic silencing (3) can affect expression of TSG mRNA. TSG mRNA level 
or translation into protein is then regulated by miRNAs (5). The availability of 
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miRNAs for TSG downregulation is further regulated by ceRNA-mediated 
sponging effects (4). Finally, additional translational regulation (6) or post- 
translational modifications contribute to the final protein expression, function 
and effective dosage. The protein structure shown is PTEN (Protein Data Bank 
accession code 1D5R). 


11 AUGUST 2011 | VOL 476 | NATURE | 167 


©2011 Macmillan Publishers Limited. All rights reserved 


REVIEW 


change the protein sequence of an expressed gene could still have a 
powerful impact by altering the level of TSG expression through altera- 
tion of miRNA binding. Moreover, SNPs in non-coding RNAs could 
have an impact on tumour development through ceRNA effects. 
Alternatively, on the basis of a continuum model, susceptibility to cancer 
could be induced by environmental factors through subtle transcriptional 
up- or downregulation of relevant genes. Development of drugs that 
prevent and counter these fluctuations could be used as chemopreventa- 
tive agents to reduce the impact of environmental carcinogens or to 
correct the effects of inherited predisposing factors. 

Just as transcriptional or post-transcriptional regulation may deter- 
mine cancer susceptibility, it could also be harnessed for cancer preven- 
tion and therapy (Fig. 5). As an example, statins, which are already in 
clinical use, were recently found to upregulate PTEN through PPARy 
signalling®. Statins or PTEN-enhancing drugs could therefore be used 
not only to treat dyslipidaemic cases but also to elevate PTEN expression 
in patients with tumours with lowered levels of PTEN (Fig. 5). Such an 
approach would require a quantitative analysis of the expression levels 
of TSGs and oncogenes and not solely a genetic assessment of their 
presence or absence. 

TSG-targeting microRNAs add another dimension to genotype/ 
phenotype considerations in targeted cancer prevention and therapy. 
Overexpression of one or more of TSG-targeting miRNAs could confer 
cancer susceptibility, and tumours not showing genetic alterations in the 
TSG but with aberrant TSG levels due to miRNA misregulation could 
behave similarly to tumours with genetic deletion or mutation of the 
gene. Mapping the interactions between miRNAs and TSGs could be 
useful for defining and predicting tumour response to therapy. 

Provocatively, tumours with partial loss of TSGs like PTEN that show 
obligate haploinsufficiency might be treated with agents to fully suppress 
the expression of the TSG, thereby suppressing tumour growth via 
senescence or other fail-safe mechanisms (Fig. 5). Alternatively, when a 
TSG is completely lost, activation of alternative tumour-suppressive 
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Figure 5 | Opportunities for therapeutic intervention after partial or 
complete loss of TSG expression. Tumour genotype ultimately determines 
which therapies can be considered for intervention. Broadly, two general 
approaches can be taken: restoration of the TSG pathway or function (upper 
panel) or induction of fail-safe mechanisms by complete downregulation of the 
TSG (lower panel). In tumours with downregulation of TSG expression in the 
absence of complete mutation or deletion, expression of the TSG can be 
attempted using drugs that induce the TSG transcription, inhibit miRNA- 
induced TSG downregulation, or relieve epigenetic silencing. Alternatively, fail- 
safe mechanisms may be induced by further downregulation of the TSG, such 
as inhibition of PTEN by the small molecule VO-OHpic®. In tumours with 
mutation or complete loss of the TSG, activation of other TSG pathways could 
be pursued. Alternatively, if a synthetic lethality relationship with the TSG is 
known, then the synthetic lethal gene can be targeted to induce cell death in the 
cells with complete loss of the TSG. 
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pathways or synthetic lethality approaches could be considered*’ 
(Fig. 5). The success of such strategies would obviously depend on the 
integrity of the fail-safe response and the expression level of the TSG, 
once again requiring a careful annotation of the molecular characteristics 
of the cancer lesion subjected to treatment. The need for the same careful 
annotation is not restricted to targeted therapies but applies to any cancer 
therapy because variations of oncogene or TSG dosage may well be at the 
core of failure of more conventional radio- or chemotherapies. 

On this basis, the scientific challenge will be to develop quantitative 
assays that can assess and score subtle differences in TSG gene expres- 
sion to allow accurate diagnostics and prediction of risk/prognosis. Such 
diagnostics will also be important for analysis and prediction of response 
to therapeutics, because subtle variations could also determine response 
to therapy. A tumour with a partial downregulation of a TSG could 
respond differently from a tumour that has either no alteration of the 
TSG or complete mutation/loss of the gene (Fig. 5). Critically, in cases 
where a tumour shows downregulation of a TSG via transcriptional 
repression, epigenetic downregulation, or aberrant miRNA and 
ceRNA regulation, there is an opportunity to explore therapeutic res- 
toration of TSG expression. 

The recent explosion of knowledge in the field of non-coding RNA 
biology has opened a new frontier in understanding gene dosage regu- 
lation and its involvement in tumorigenesis. In the months and years to 
come, careful experimentation is required to map out the networks of 
gene regulation of TSGs and oncogenes to allow drug development and 
creation of diagnostics from this information. Forty years after the two- 
hit hypothesis, we are refining and extending our understanding of TSG 
inactivation in cancer. Milestones made in these past forty years have 
contributed to the decline in cancer deaths observed in the US since 1990 
(ref. 64). The next decades promise further relief from the burden of 
cancer as we continue to tease out its molecular genetic basis. 
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African Americans 
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Recombination, together with mutation, gives rise to genetic variation in populations. Here we leverage the recent 
mixture of people of African and European ancestry in the Americas to build a genetic map measuring the probability of 
crossing over at each position in the genome, based on about 2.1 million crossovers in 30,000 unrelated African 
Americans. At intervals of more than three megabases it is nearly identical to a map built in Europeans. At finer scales 
it differs significantly, and we identify about 2,500 recombination hotspots that are active in people of West African 
ancestry but nearly inactive in Europeans. The probability of a crossover at these hotspots is almost fully controlled by the 
alleles an individual carries at PRDM9 (P value < 10~**°). We identify a17-base-pair DNA sequence motif that is enriched 
in these hotspots, and is an excellent match to the predicted binding target of PRDM9 alleles common in West Africans 
and rare in Europeans. Sites of this motif are predicted to be risk loci for disease-causing genomic rearrangements in 
individuals carrying these alleles. More generally, this map provides a resource for research in human genetic variation 


and evolution. 


In humans and many other species, recombination is not evenly 
distributed across the genome, but instead occurs in ‘hotspots’: 
2-kilobase (kb) segments where the crossover rate is far higher than 
in the flanking DNA sequence’. The highest-resolution genetic map 
in contemporary humans so far—the deCODE map—is based on 
about 500,000 crossovers identified in 15,000 Icelandic meioses*. 
However, a limitation of maps built in people of European descent*® 
is that they may not apply equally well in other populations, as sug- 
gested by comparisons of maps across ethnic groups*’ ° and patterns 
of linkage disequilibrium breakdown, which indicate that more of the 
genome may be recombinationally active in West Africans’®. It is 
known that a major determinant of the positions of recombination 
hotspots is PRDM9, a meiosis-specific histone H3 methyltransferase 
whose zinc finger (ZF) domain binds DNA sequence motifs'’*. In 
Europeans, PRDM9 ZF arrays are predominantly of two similar types, 
A and B, both of which bind the 13-bp motif CCNCCNTNNCCNC"'. 
In contrast, 36% of West African alleles are not of the A or B type”””’. 
Sperm typing of males who carry neither the A nor the B allele has 
shown no evidence of crossover activity at recombination hotspots 
associated with the 13-bp motif”. 


Building an African-American genetic map 


To investigate differences in the crossover landscape across human 
populations, we built a genetic map in African Americans, who have 
an average of about 80% West African and 20% European ancestry, 
leading to genomes comprised of multi-megabase stretches of either 
West African or European ancestry'*. Computational approaches, 
including HAPMIX”°, have been developed to infer the probability of 
0, 1 or 2 European or African alleles at each locus in individuals geno- 
typed at hundreds of thousands of single nucleotide polymorphisms 
(SNPs)'*!”. Positions where the inferred number of European or 
African alleles changes reflect crossover events that have occurred since 
admixture began (on average six generations ago'’). Change in the 
probability of European ancestry between adjacent SNPs can be inter- 
preted as the probability of such a crossover between them. We inferred 
crossover events in 29,589 apparently unrelated African Americans 
who had been genotyped on SNP arrays in genetic association studies 
(Methods; Fig. 1a). To minimize false-positive crossovers, we restricted 
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to crossovers that HAPMIX inferred with a probability of >95%, and 
that were flanked by a minimum of 2-centimorgan (cM) stretches 
where the ancestry was inferred to be unchanging (Supplementary 
Note 1). This produced 2,113,293 high-confidence crossovers, with a 
typical switch point resolved within 70kb with probability 50% 
(Supplementary Note 1). 

To build a high-resolution African-American genetic map (AA map), 
we leveraged the fact that most crossovers occur in hotspots shared 
across individuals* (Methods). Intuitively, although any crossover can 
only be roughly localized, inter-SNP intervals that are inferred to have 
an appreciable probability of crossover in multiple individuals are likely 
to contain recombination hotspots, allowing much better localization 
(Supplementary Fig. 1). To implement this idea, we modelled the 
recombination rate for each inter-SNP interval as shared across indivi- 
duals and used Markov chain Monte Carlo (MCMC) to sample rates 
consistent with the data (Methods). This provides well-calibrated 
estimates of the crossing-over rate between all pairs of markers as well 
as estimates of rate uncertainty (Supplementary Note 1 and Sup- 
plementary Fig. 2). We find that the interval size at which the average 
recombination rate is equal to the standard error is 6 kb, which is the 
same accuracy that would be expected from a map based on 500,000 
crossovers whose boundaries were precisely resolved (Supplementary 
Note 1). Despite this high resolution, there are also some limitations. 
First, the AA map does not separately infer male and female recom- 
bination rates (it is a sex-averaged map) and requires normalization by 
the total map length (like linkage disequilibrium maps*”*). Second, the 
map has less resolution and may miss a higher fraction of true cross- 
overs at loci where it is more difficult to detect and resolve crossovers 
owing to low SNP density or low differentiation between West 
Africans and Europeans. Third, the map may be biased where ancestry 
deviates from the average, for example at chromosome 8q24, where 
the 10% of the people in this study who have prostate cancer have an 
increased proportion of African ancestry’. Fourth, the map assumes 
that all individuals are unrelated, whereas in fact there is probably 
some shared ancestry, resulting in multiple counting of some cross- 
overs and an overestimation of map precision. 

To assess the accuracy of the AA map, we generated an independ- 
ent African-American pedigree map by analysing 222 nuclear families 
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Figure 1 | Building an African-American genetic map. a, HAPMIX detection 
of crossovers between segments of inferred ancestry is illustrated in a father- 
mother-child trio. Black segments show inferred crossovers; arrows show 
transmission of ancestral crossovers from parent to child; purple/green segments 
show de novo events (paternal/maternal origin, respectively) corresponding to 


that included 1,056 meioses in which we could directly detect cross- 
overs between parent and child (Methods; Fig. 1a). Examination of 
the AA map rate around directly detected crossovers confirms the 
high resolution: the rate around such crossovers shows at least as 
strong a peak as that observed in maps based on linkage disequilib- 
rium**"* (Supplementary Fig. 3). We next computed correlation co- 
efficients for both the AA map and the deCODE map* to maps derived 
from the breakdown of linkage disequilibrium in Europeans (CEU) 
and West Africans (YRI)'*. At broad scales (>3 Mb) they are almost 
identical (9 > 0.97; Table 1). At fine scales, the AA map is more 
accurate (Table 1 and Supplementary Table 1), as reflected in a modest 
improvement in correlation to the CEU map at a 3-kb scale 
(Paa.ceu = 0.66 versus PgecopE,cEU = 9.58), and a major improve- 
ment for the YRI map, also at a 3-kb scale (94a ypi = 0.71 versus 
PaeCovE,yrI = 9.53). The deCODE map is more correlated to the 
CEU map than to the YRI map at scales <1 Mb, suggesting that this 
map, built in Icelanders, reflects more European recombination rates. 
The AA map shows the opposite pattern, suggesting that it reflects 
more West African recombination patterns. 


Table 1 | Genetic map assessments at different size scales 
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events identified directly using two additional children (bottom, ‘pedigree 
inferred’). b, The AA map localizes five hotspots in a region of the MHC whose 
positions (blue) were previously mapped by sperm typing’. c, Comparison of 
maps shows a hotspot at 33.1 Mb in the African-derived AA and YRI maps, but 
not the deCODE and CEU maps (all maps smoothed to 10 kb). 


Population differences in hotspot locations 


We compared the rate estimates for all four maps (AA, deCODE, CEU 
and YRI) over a 200-kb region within the major histocompatibility 
complex (MHC) locus where recombination rates in European males 
have been characterized through sperm typing' (Fig. 1b). The AA map 
detects five of six known hotspots, and localizes them to within 1 kb (the 
sixth hotspot is weak, with a peak male rate below the genome average’). 
Notably, the two maps based on samples with African ancestry (AA and 
YRI) found a hotspot not present in either map based on samples of 
European ancestry (deCODE and CEU) (Fig. 1c; Supplementary Fig. 4 
gives a second example). We confirmed that such ‘African-enriched’ 
hotspots also occur genome-wide, by examining 2,375 loci with recom- 
bination rate peaks in the YRI map (>5cM Mb“ ') but not the CEU 
map (<lcM Mb’), and finding a rate rise in the independently 
generated AA map, but not in the deCODE map (Supplementary 
Fig. 5A). In the reciprocal experiment searching for European-specific 
hotspots, we find no such evidence for genuine ancestry specificity; at 
loci with recombination rate peaks in the CEU map but not the YRI 
map, there are weak peaks in both the deCODE and AA maps 


Scale (interval size) Pearson correlation (p) of the AA map (deCODE map) to the Estimated correlation of AAmap to Estimated coefficient of variation of AA map (s.e. 
specified LD map the true map (inferred by MCMC)+ divided by crossover rate expected for interval size)+ 
Combined LD* CEU YRI 

3kb 0.75 (0.63) 0.66 (0.58) 0.71 (0.53) 0.93 1.41 

10kb 0.82 (0.74) 0.73 (0.70) 0.78 (0.65) 0.96 0.73 

30 kb 0.86 (0.83) 0.78 (0.78) 0.83 (0.74) 0.98 0.36 

100 kb 0.91 (0.89) 0.84 (0.85) 0.87 (0.81) 0.99 0.17 

300 kb 0.94 (0.93) 0.89 (0.90) 0.92 (0.88) 1.00 0.08 

1 Mb 0.97 (0.96) 0.94 (0.94) 0.95 (0.95) 1.00 0.04 

3 Mb 0.98 (0.98) 0.97 (0.97) 0.98 (0.97) 1.00 0.02 


The numbers in this table are restricted to the autosomes and genomic segments more than 5 Mb from the telomeres. LD, linkage disequilibrium; s.e., standard error. 


*The combined map is the HapMap2 population-averaged linkage-disequilibrium-based map?®. 


+The s.e. of the map at each size scale is determined by the posterior probability distribution from the MCMC. 
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(Methods and Supplementary Fig. 5B). Thus, hotspots active in 
Europeans are consistently ‘shared’ with YRI and African Americans, 
whereas populations with African ancestry harbour additional, non- 
shared hotspots that we call “African-enriched’. 


Mapping variants underlying population differences 

To understand the features of recombination in West Africans that 
differ from Europeans, we estimated the degree to which each 
African-American person’s crossovers occur in African-enriched hot- 
spots, compared with shared hotpots, a phenotype we refer to as their 
African enrichment (AE). We view each individual’s crossovers as 
sampled from a mixture of two genetic maps—an ‘S map’ of shared 
hotspots based on the deCODE map, and an ‘AE map’ of African- 
enriched hotspots that is learned from comparing the deCODE and 
AA maps—so that the proportion of crossovers assigned to the AE 
map is a person’s AE phenotype (Supplementary Note 4). We tested 
approximately 3 million SNPs (genotyped and imputed) for asso- 
ciation with three phenotypes: AE, usage of linkage-disequilibrium- 
based hotspots known to be enriched for the 13-bp motif 


250 4 


200 4 


-log10 P value 
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CCNCCNTNNCCNC” and genome-wide crossover rate (in pedigrees) 
(Methods and Supplementary Note 4). In crossovers detected in un- 
related African Americans, the alleles a person carries are only 
sometimes descended from the ancestor in whom the crossover 
occurred, thus adding noise to the association signal (nevertheless 
there is useful signal given the large sample size; Supplementary 
Note 4). In the pedigree map, association between alleles and AE can 
be tested directly because we have genotypes in the parents. 

The SNP showing the strongest association with AE is rs6889665 
(P=1.5 X10 ~**; Fig. 2a and Supplementary Fig. 6), which has a 
derived allele frequency of 29% in YRI and 2% in CEU, and is within 
4kb of the ZF array of PRDM9 (refs 4, 9, 11-13). This SNP is asso- 
ciated with AE in both the pedigree individuals and the unrelated 
individuals (Supplementary Note 4), and is also the SNP most 
strongly associated with usage of linkage-disequilibrium-based hot- 
spots (P= 1.8 X 10°) (Supplementary Table 2). No locus outside 
PRDM9 is significant (P< 0.01 after Bonferroni correction; Sup- 
plementary Table 2). To understand better the association at 
1s6889665, we inferred the alleles in the PRDM9 ZF array carried 
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Figure 2 | Association of PRDM9 genetic variation with hotspot activity. 
a, A genome-wide association study measuring association of the AE 
phenotype shows a single genome-wide significant peak at PRDM9, with 
1s6889665 the best-associated SNP. b, Relationship between alleles of 
rs6889665 and predicted binding target of the PRDM9 ZF array’ for West 
African and European samples. The binding predictions are grouped into 8 
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clusters according to their best-matching region to the 13-bp motif, and 
annotated by the number of bases matching the motif. The African-enriched 
186889665 C allele always co-occurs with motifs with a poor (5/8) match to the 
13-bp motif. c, Gene tree” of the linkage disequilibrium block containing the 
PRDM9 ZF array (Methods); numbered circles show SNPs and significant P 
values for association, after conditioning on 1s6889665. 
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by 139 individuals based on sequencing data from the 1000 Genomes 
Project’®, using the reads to infer each individual’s PRDM9 alleles 
among 29 alleles whose full sequences were previously determined? 
(Supplementary Note 5). Grouping PRDM9 alleles on the basis of how 
closely their binding target predictions match the 8 non-degenerate 
bases of the 13-bp motif, following a previously described approach’, 
we find that the ancestral “I” variant at rs6889665 is strongly corre- 
lated to alleles with an exact (8/8) match to the 13-bp motif (including 
the A and B alleles), whereas the derived ‘C variant is almost perfectly 
correlated to a group of alleles, all predicted to bind a common, 
different 17-bp motif—CCgCNgtNNNCgtNNCC’—which matches 
the 13-bp motif at only 5 bases (5/8 match; less strongly signalled 
bases in the motif are in lowercase and ‘N’ may be any base). This 
implies a common historical origin for alleles matching this 17-bp 
motif (Fig. 2b, Supplementary Fig. 7 and Supplementary Note 5). We 
also experimentally measured the number of ZF domains in PRDM9 
in 354 individuals including 166 African Americans from the pedigree 
study (Methods). This showed, again, that rs6889665 differentiates 
PRDM9 alleles into two different classes, with 96% of haplotypes 
carrying the ancestral allele having <14 ZFs, and 93% of haplotypes 
carrying the derived allele having =14 ZFs (Supplementary Fig. 7). 
After conditioning on 1s6889665, there is no evidence that ZF array 
length is associated with the AE phenotype. Several SNPs near the 
PRDM9 ZF array show a conditional association signal that is much 
weaker than rs6889665, but still significant (Fig. 2c, Supplementary 
Fig. 6 and Supplementary Note 4), with the strongest at rs10043097 
(P=8.3 X10 '), upstream of the PRDM9 transcription start site. 
These SNPs may tag additional variation in the PRDM9 ZF array, or 
potentially expression levels. 


Finding a motif for African-enriched hotspots 


To identify directly candidate African-enriched hotspot motifs, we 
selected 2,454 loci with a high crossover rate in the AE map and 
YRI map (>2 cM Mb * over 2kb), and no more than half this rate 
in the S map and CEU map (this set is more powerfully enriched for 
higher recombination in people of African ancestry than the 2,375 
above, as it includes information from the contemporary maps). We 
compared these to a ‘control set’ of 7,328 candidate hotspots more 
active in the European- than the African-derived maps (Methods and 
Supplementary Note 6). To identify sequence motifs associated with 
the African-enriched hotspots*”, we identified short motifs that 
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Figure 3 | A sequence motif specifying the positions of African-enriched 
hotspots. a, Logo plot showing a degenerate 17-bp hotspot motif, with stack 
height proportional to —log P value, and relative letter height proportional to 
the mean crossover rate increase given each base. Below is the bioinformatic 
PRDM59 binding prediction for the alleles associated with 1s6889665 allele C 
(from Fig. 2b), matching this motif at 10/11 bases (lines). b, Average crossover 
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occurred at increased frequency in the African-enriched hotspot set 
(Supplementary Note 6). Testing all motifs with lengths of 5-9 bases 
revealed a 9-nucleotide motif CCCCAGTGA (odds ratio (OR) = 1.79, 
P=2.24 X10 °, Bonferroni corrected P = 0.004), which exhibited a 
kilobase-scale rate peak near occurrences of this motif in African- 
derived maps, but in neither of the European-derived maps (Sup- 
plementary Fig. 8). Further analysis revealed a strong influence of 
downstream flanking bases (Supplementary Fig. 9) and degeneracy, 
yielding a 17-bp consensus sequence, CCCCaGTGAGCGTtgCc 
(Fig. 3a; more strongly signalled bases are in uppercase), with the 
same consensus obtained when we considered flanking sequences 
for only odd or even chromosomes, and whether we based the analysis 
on AE-S or YRI-CEU map comparisons (Supplementary Note 6). 
The 500 best matches to this motif have a ~3-fold increase in average 
rate in the AA and YRI relative to the deCODE and CEU maps (Fig. 3b 
and Supplementary Fig. 8G). Hotspots associated with the motif occur 
in both unique and repetitive DNA (for example, L1PA10/13 LINE 
elements; Supplementary Fig. 10 and Supplementary Note 6). We also 
compared the 17-bp consensus to the binding motif predicted for 5/8 
match alleles, and found that they match almost precisely (Fig. 3a; 10 
of 11 bases, P= 8.1 X 10°). 


Assessing the impact of PRDM9 on recombination 


How much of the African-enriched recombination pattern can be 
explained by PRDM9? We estimated the fraction of variation in the 
AE phenotype explained by rs6889665 in our pedigree data after 
accounting for noise in the phenotype estimation (Supplementary 
Note 4). Over 82% of map usage variability is explained by the 
186889665 genotype alone. Given that there are further influential 
PRDMS9 variants (Fig. 2c), this gene may thus explain almost all dif- 
ferences in local rate between the West African and European popu- 
lations. We next examined rates around 82 narrowly defined 
(<10 kb) crossover sites in 7 individuals homozygous for the derived 
allele at rs6889665. There is no evidence of hotspots at these loci in 
either the deCODE or CEU maps (Fig. 3c), in contrast to crossovers in 
individuals carrying the ancestral allele at rs6889665 (Supplementary 
Fig. 11). Thus, crossover positions in individuals who are homozygous 
for the derived allele at rs6889665 are consistent with an entirely dif- 
ferent recombination hotspot landscape, which would imply PRDM9 
control of all hotspots’. Despite the strong correlation between maps at 
megabase scales, there is mounting evidence that PRDM9s influence 


Rates around crossovers in CC individuals 
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rate (in 2-kb sliding windows) in the AA (red line) and deCODE (black line) 
maps surrounding the 500 strongest motif matches. c, In seven rs6889665 CC 
individuals from the pedigree study, we localized 82 crossovers to within 10 kb, 
and plot average AA, YRI, deCODE and CEU map rates. There is no strong 
peak above local background in the deCODE or CEU maps. 
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on crossing over may not be limited to fine scales*'': we observe a 
weakly significant association of rs6889665 with the total number of 
crossovers genome-wide in pedigrees (P = 0.04), corresponding to an 
average 1.3 crossovers more per meiosis per derived allele, exceeding 
the strongest previously known association” at RNF212. 


Conclusions 


We have shown that PRDM9 alleles that bind a novel 17-bp motif and 
occur at greatly increased frequency in people of West African ancestry 
have led to a shift in the recombination landscape compared with 
people of non-African ancestry. The larger number of hotspots avail- 
able to West Africans implies that at the population level, crossovers 
are more evenly distributed than in Europeans", and thus the shorter 
extent of West African linkage disequilibrium is not due to differences 
in demographic history alone (such as the lack of an out-of-Africa 
founder event). Our findings also have medical implications, as 
recombination errors leading to insertions or deletions are known to 
be associated with recombination hotspots’”’**. Our results predict 
that the congenital abnormalities that have been associated with the 
recombination hotpots bound by PRDM9 A and Balleles will occur ata 
decreased rate in people of West African ancestry, whereas new dis- 
eases will arise due to recombination errors near African-enriched 
hotspots. 


METHODS SUMMARY 


We assembled SNP array data from 29,589 unrelated people and 222 nuclear 
families genotyped at 490,000-910,000 SNPs from the Candidate Gene Association 
Resource (CARe), studies at the Children’s Hospital of Philadelphia (CHOP), the 
African American Breast Cancer Consortium, the African American Prostate 
Cancer Consortium and the African American Lung Cancer Consortium. To build 
a recombination map, we used HAPMIX to localize candidate crossover positions’®, 
and implemented a MCMC that used the probability distributions for the positions 
of the filtered crossovers to infer recombination rates for each of 1.3 million inter- 
SNP intervals. We also implemented a second MCMC that models each individual’s 
set of crossovers as a mixture of an S map, similar to the European deCODE map, 
and an AE map, and then assigned each individual an ‘AE phenotype’ correspond- 
ing to the proportion of their newly detected crossovers assigned to the AE map. We 
imputed genotypes at up to three million HapMap2 SNPs"* and then tested each of 
these SNPs for association with the AE phenotype and other recombination- 
related phenotypes. We identified 2,454 candidate A frican-enriched hotspots with 
increased recombination rates in the YRI versus CEU maps, and in the AE versus S 
maps, and searched for motifs enriched at these loci, thus identifying a degenerate 
17-bp motif. To study the structure of PRDM9, we measured the length of the 
PRDM9 ZF array and genotyped rs6889665 in YRI, CEU and the CARe nuclear 
families; we also carried out imputation based on 1000 Genomes Project short read 
data" to infer the alleles individuals carry, among 29 previously characterized in a 
sequencing study of PRDM9 (ref. 9). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Samples used for building the AA map. The 29,589 unrelated African-American 
samples derive from five sources. Informed consent was provided by all the 
individuals participating in the study, and was approved by all of the institutions 
responsible for sample collection. 

The first source is the Candidate Gene Association Resource (CARe) study, a 
consortium of cohorts. We analysed CARe samples genotyped on the Affymetrix 
6.0 array from the Atherosclerosis Risk in Communities study (ARIC), the 
Cleveland Family Study (CFS), the Coronary Artery Risk Development in 
Young Adults study (CARDIA), the Jackson Heart Study (JHS) and the Multi- 
Ethnic Study of Atherosclerosis (MESA). After removing individuals known to be 
related, and restricting to SNPs with good completeness in all cohorts, we had 
data from 6,209 individuals typed at 580,000 SNPs. 

The second source consists of diverse studies carried out at the Children’s 
Hospital of Philadelphia (CHOP), which has established a biobank for 
Philadelphia children to facilitate large genotype-phenotype association analysis. 
The cohort was recruited by CHOP clinicians, nursing and medical assistant staff 
within the CHOP Health Care Network, including primary care clinics and out- 
patient practices, from the hospital’s patient base of over one million paediatric 
patients. All samples analysed here were genotyped on either the Illumina 610- 
Quad or Illumina HumanHap550 array. After removing individuals known to be 
related, identifying American Americans by multidimensional scaling on geno- 
type data, and restricting to SNPs with a high level of completeness across sam- 
ples, we had data from 7,503 samples typed at 491,572 SNPs. 

The third source is the African American Breast Cancer Consortium 
(AABCC), consisting of the Multiethnic Cohort study (MEC), the Los Angeles 
component of the Women’s Contraceptive and Reproductive Experiences study 
(CARE), the Women’s Circle of Health Study (WCHS), the San Francisco Bay 
Area Breast Cancer study (SFBC), the Carolina Breast Cancer Study (CBCS), the 
Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial Cohort (PLCO), 
the Nashville Breast Health Study (NBHS) and the Wake Forest University Breast 
Cancer Study (WFBC), all genotyped on an Illumina 1M array. After data cura- 
tion, including removal of samples with genetic evidence of being second-degree 
relatives or closer using the smartrel package of EIGENSOFT”® (>0.2 correlation 
of genotype state), we had data from 5,203 women (about half cases and half 
controls) typed at 894,717 SNPs. 

The fourth source is the African American Prostate Cancer Consortium 
(AAPCC), consisting of the MEC, the Southern Community Cohort Study 
(SCCS), PLCO, the Cancer Prevention Study II Nutrition Cohort (CPS-ID), the 
Prostate Cancer Case-Control Studies at MD Anderson (MDA), the Identifying 
Prostate Cancer Genes study (IPCG), the Los Angeles Study of Aggressive Prostate 
Cancer (LAAPC), the Prostate Cancer Genetics Study (CaP Genes), the Case- 
Control Study of Prostate Cancer among African Americans in Washington DC 
(DCPC), the Gene-Environment Interaction in Prostate Cancer Study (GECAP) 
and the Cancer Prevention Study II (CPS-II), all typed on an Illumina 1M array. 
After the same data curation as the breast cancer study, we had data from 6,540 
men (about half cases and half controls) typed at 896,036 SNPs. 

The fifth source is individuals from the African American Lung Cancer 
Consortium (AALCC), including cases and controls from the MEC, the SCCS, 
PLCO, the MD Anderson (MDA) African American Lung Cancer Study, the 
NCI-Maryland Lung Cancer Case-Control Study, the University of California 
at San Francisco African American Lung Cancer Study and the Wayne State 
African American Lung Cancer Study, all genotyped on the Illumina 1M array. 
After data curation, we had data from 4,134 individuals typed at 906,687 SNPs. 
Samples used for building the pedigree map. The pedigree map was built using 
data from 135 African-American nuclear families from CARe and 87 African- 
American families from CHOP for which genotyping data were available from at 
least two full siblings and at least one parent. The CARe studies that contributed 
samples were JHS (70 families, including 58 samples that we newly genotyped on 
the Affymetrix 6.0 array to increase the number of crossovers we could analyse) 
and CFS (65 families). For the families with a missing parent, we developed a 
Hidden Markov Model (HMM) approach to jointly estimate the genotype of the 
missing parent as well as to infer the position of crossover events in the offspring. 
The observed variables in the HMM were the genotypes of the available family 
members and the states of the HMM were the genotypes of the parents and the 
identity by descent (IBD) status of the children. A change in IBD status in an 
offspring is interpreted as a crossover event. Supplementary Note 2 provides 
details of the HMM used to infer positions of these pedigree crossover events. 
Local ancestry inference and identification of crossover events. We merged the 
data for each cohort with phased YRI and CEU data from the HapMap3 data set”’. 
We filtered SNPs that had a frequency inconsistent with an 80-20% linear com- 
bination of YRI and CEU frequencies (t statistic with an absolute value of greater 


than 3), potentially reflecting genotyping error in either the HapMap3 or the 
cohort data. 

We ran HAPMIX on these data using a prior hypothesis of 20% European 
ancestry and 6 generations since mixture for each individual'*. HAPMIX requires 
users to input a recombination map as a prior distribution, and we assumed that 
rates were constant across each chromosome arm with a total rate across each arm 
determined by the Rutgers genetic map® (Supplementary Note 1). 

Filtering of crossover events had three stages. First, we removed crossover 
events where the probability of occurrence was estimated to be less than 95% 
by HAPMIX. Second, we removed candidate crossover events that were non- 
monotonic, that is, where the probability of an overlapping crossover event with 
an ancestry switch in a different direction was =1% within any inter-SNP inter- 
val. Third, we removed crossover events where either of the two flanking ancestry 
blocks was smaller than 2 cM in size as measured with respect to a published map 
based on linkage disequilibrium*'* (Supplementary Note 1). For comparisons to 
the deCODE map and linkage-disequilibrium-based maps, we also removed 
segments of the genome within 5 Mb of the telomeres (to be consistent with 
the comparisons presented in the deCODE study where the same restriction 
was applied’). 

Construction of the AA map. All 22 autosomes and chromosome X were split 
into approximately 1.3 million inter-SNP intervals based on the union of SNPs 
analysed across all five sample sets. Our goal was to estimate a crossover rate for 
each of these intervals. We modelled crossover rates such that the rate for each 
SNP interval is independent of every other SNP interval, motivated by a hotspot 
model. We used a gamma prior on rates with the mean estimated from the filtered 
HAPMIX output (Supplementary Note 1). We used a Gibbs sampler to sample 
rates in every SNP interval and to determine the location of a crossover event 
within the 95% range estimated by the HAPMIX output. In each round of the 
Gibbs sampler, we used the set of sampled rates in the previous round to construct 
a probability mass function for the SNP interval in which each crossover 
occurred, using an approach described in Supplementary Note 1 to approximate 
the probability mass function that HAPMIX would have produced conditional on 
the previous set of sampled rates. After sampling the location of the crossover 
events, we counted how many crossovers occurred in every SNP interval. We used 
these counts to construct a posterior distribution for the crossover rate in each 
SNP interval, taking advantage of the conjugacy of a Poisson likelihood and a 
gamma prior. We then sampled a crossover rate for each SNP interval from its 
respective gamma posterior distribution. 

Candidate African-enriched hotspots. To identify candidate African-enriched 
hotspots, we used two pairs of maps: the previously available YRI map and CEU 
map, and the AE map and the S map. We combined information from both map 
pairs to enrich for regions with genuine differences between the West African and 
European populations. Specifically, we identified candidate hotspots as 2-kb 
intervals representing a peak in the AE map rate, where the estimated rate in 
the AE map was >2cM Mb ' and at least double that in the § map, and in 
addition the YRI map rate was >2cM Mb ' and at least double the CEU map 
rate. We took the resulting candidate hotspot set and defined hotspot boundaries 
by identifying the region flanking the 2 kb rate peak that had rates at least 50% of 
the peak value in the AE map. Regions larger than 5 kb were discarded. We 
similarly constructed a set of ‘shared’ hotspots but modified the initial criteria 
given the lack of obvious hotspots present only in people of European ancestry. 
Specifically, we identified 2 kb S map rate peak locations where both the S and 
CEU estimated rates were >2 cM Mb |, while the AE and YRI map rates were 
below those in these respective European populations. We then narrowed the 
regions and filtered using the same procedure we had developed for the candidate 
African-enriched hotspots. 

Association testing. MaCH” was used to impute up to 3,058,149 SNP genotypes 
from HapMap? (ref. 18) into all African Americans we analysed, using the un- 
related YRI and CEU samples as combined reference panels. We tested for asso- 
ciation at all SNPs with minor allele frequency > 1%. To restrict our analysis to 
individuals in whom the phenotype was measured accurately, we performed the 
association analysis with the AE and hotspot usage phenotypes only in indivi- 
duals with at least 35 inferred crossovers. Association testing was carried out 
using linear regression, after controlling for gender, genome-wide European 
ancestry proportion (inferred by HAPMIX) and study (Supplementary Note 
4). We observe slight inflation of the association statistics genome-wide com- 
pared with the expectation (the Genomic Control inflation factor” is 1.046 for the 
AE phenotype and 1.038 for the hotspot usage phenotype), which we propose 
may reflect cryptic relatedness among samples (Supplementary Note 4). We 
report P values after correction using Genomic Control”. 

Construction of PRDM9 tree. To examine the history of the PRDM9 ZF array 
and to place SNPs showing association with AE map usage within the framework 
of this history, we identified 19 SNPs from HapMap? (ref. 18) that surrounded the 
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ZF array and that form a maximal block of SNPs where there is almost no 
evidence of recombination: |D'| = 1 for all pairs of SNPs in the data after removing 
2 of 120 YRI and 1 of 120 CEU haplotypes (the chimpanzee genome was used to 
define the ancestral alleles). A unique ‘gene tree’ was then built, and we used 
genetree’, which assumes a coalescent prior on genealogies, to approximately infer 
ages for these mutations conditional on the data (a caveat is that the tree building 
does not account for the HapMap SNP ascertainment scheme). Because genetree 
assumes a randomly mating population, and the YRI represent almost all the 
HapMap haplotype diversity in this region, we ran the software (2,000,000 import- 
ance samples, otherwise default parameters) on the YRI data only and used this to 
construct Fig. 2c. Each node of the tree corresponds to a unique haplotype at these 
19 SNPs, whose frequency in both CEU and YRI is shown at the base of the figure. 
Motif searching. We tested all candidate motifs of 5 to 9 base pairs for enrich- 
ment in our African-enriched hotspot set relative to our shared hotspot set. We 
counted occurrences of all tested motifs in repeat and non-repeat backgrounds 
separately, and computed a separate P value for each genomic background with a 
chi-squared test, based on a contingency table that compares the counts of a 
particular motif to the counts of all motifs of that size. We converted each P value 
to a Z score, added the scores on each background, and then obtained a corres- 
ponding combined P value. Motifs were considered statistically significant only if 
they passed four stringent criteria: (1) they were statistically significant after 
Bonferroni correction for the number of motifs tested; (2) they were overrepre- 
sented in the African-enriched set; (3) they were statistically significant on both 
the repeat and non-repeat backgrounds (P < 0.01) independently; and (4) they 
were statistically significant when the joint P value was calculated only by com- 
paring the frequency of the motif to other motifs of identical G/C content (to 
eliminate false positives due to any difference in G/C content between the hotspot 
sets). This testing revealed a unique significant motif, the 9-nucleotide oligomer 
CCCCAGTGA. We explored whether flanking DNA around exact matches to 
this motif also had a role by testing whether bases at a given site relative to the 
motif were associated with the difference in rates between African- and 
European-ancestry populations (Kruskal-Wallis test). Rates were evaluated in 
the 2 kb surrounding each motif occurrence. We separately evaluated flanking 
sequence using both the difference between YRI/CEU map rates, and the differ- 
ence between the AE/S map rates, leading to the identification of the 17-bp 
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consensus African-enriched motif (Supplementary Note 6 has full details). To 
identify close matches to this 17-bp motif among all matches to the 9-bp motif in 
the genome, for every occurrence of the 9-bp motif, we scored the flanking 
sequence bases proportionately to the relative increase in average crossover rate 
difference associated with each base, then multiplied across bases in the 17-mer 
region to provide an overall score. We ranked occurrences according to this score, 
and plotted rates around the top 500 (Fig. 3b). We verified these findings by 
measuring average crossover differences for each base using only odd chromo- 
somes and used these to score motif occurrences on the (non-overlapping) set of 
even chromosomes, and vice versa (Supplementary Fig. 8). 

PRDM9 ZF length typing and genotyping of rs6889665. To determine the 
number of ZF motifs of PRDM9 in a subset of the samples used to build the 
map, published primer pairs* were used to amplify this region (forward: 
5'-GGCCAGAAAGTGAATCCAGG-3’, reverse: 5’-GGGGAATATAAGGGG 
TCAGC-3’). Product lengths ranged between 7 and 20 repeats (801-1,893 bp). 
Four of the 166 African-American samples did not show an amplification product, 
presumably because of insufficient DNA quality. We also genotyped 90 YRI and 90 
CEU HapMap samples. 

The SNP rs6889665 was genotyped in the same samples using an allelic dis- 
crimination assay (forward primer: 5’ -aaacttggaacatccatagggt-3’, reverse primer: 
5’ -cgaaaggagaaaagcataatcc-3’, Locked Nucleic Acid (LNA) probe ‘C’: 5'-/6-FAM/ 
aGGGatAaatgaag/BHQ/-3’, LNA-probe “T’: 5'-/HEX/ AGAGatAaatGaagg/ 
BHQ/-3'; LNA bases are given in capital letters). Reporter dyes: 6-FAM, 6- 
carboxyfluorescein; HEX, hexachlorofluorescein. Quencher: BHQ, Black Hole 
Quencher 1. Only one out of the 166 African-American samples failed in this 
assay. The same YRI and CEU samples as above were also genotyped. 
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Hydrogen is an energy source for 
hydrothermal vent symbioses 


Jillian M. Petersen, Frank U. Zielinski!**, Thomas Pape’, Richard Seifert*, Cristina Moraru', Rudolf Amann’, Stephane Hourdez?, 
Peter R. Girguis®, Scott D. Wankel’, Valerie Barbe’, Eric Pelletier’, Dennis Fink', Christian Borowski!, Wolfgang Bach® & Nicole Dubilier! 


The discovery of deep-sea hydrothermal vents in 1977 revolutionized our understanding of the energy sources that fuel 
primary productivity on Earth. Hydrothermal vent ecosystems are dominated by animals that live in symbiosis with 
chemosynthetic bacteria. So far, only two energy sources have been shown to power chemosynthetic symbioses: 
reduced sulphur compounds and methane. Using metagenome sequencing, single-gene fluorescence in situ 
hybridization, immunohistochemistry, shipboard incubations and in situ mass spectrometry, we show here that the 
symbionts of the hydrothermal vent mussel Bathymodiolus from the Mid-Atlantic Ridge use hydrogen to power primary 
production. In addition, we show that the symbionts of Bathymodiolus mussels from Pacific vents have hupL, the key 
gene for hydrogen oxidation. Furthermore, the symbionts of other vent animals such as the tubeworm Riftia pachyptila 
and the shrimp Rimicaris exoculata also have hupL. We propose that the ability to use hydrogen as an energy source is 
widespread in hydrothermal vent symbioses, particularly at sites where hydrogen is abundant. 


Deep-sea hydrothermal vents and their associated chemosynthetic 
communities were discovered in 19771. Since then, two energy sources 
have been shown to fuel primary production by the symbiotic bacteria 
that form the basis of the food chain in marine chemosynthetic eco- 
systems. In 1981, chemolithoautotrophic bacteria that use reduced 
sulphur compounds as an energy source were discovered in the 
gutless Riftia pachyptila tubeworms from hydrothermal vents in the 
Pacific>’. Five years later, the first symbionts that use methane as an 
energy source were discovered in mussels from hydrocarbon seeps in 
the Gulf of Mexico**. Since then, a vast array of chemosynthetic eco- 
systems has been explored, and novel symbioses of phylogenetically 
diverse hosts and symbionts are constantly being described®. Despite 
this, no other source of energy for metazoan chemosynthetic symbioses 
has yet been found. This is remarkable, given that many potential 
sources of energy for chemosynthesis are available, such as hydrogen, 
ammonium, ferrous iron and manganese(II)’, and free-living vent 
microbes able to use these energy sources are well known*”°. 

Some hydrothermal vents produce fluids with very high hydrogen 
concentrations due to the interaction of seawater with mantle-derived 
ultramafic rocks. Fluids originating from ultramafic-hosted vents are 
also characterized by high methane concentrations, whereas H,S con- 
centrations are rather low. In contrast, basalt-hosted vents produce 
fluids comparably high in H,S but low in H, and CH. The Logatchev 
vent field at 14° 45’ N on the Mid-Atlantic Ridge (MAR) is located ina 
ridge segment characterized by ultramafic outcrops (Supplementary 
Fig. 2). Fluids venting at this site have the highest hydrogen concen- 
trations ever measured in hydrothermal systems (19 mM in the end- 
member fluids)'’, which would provide a rich source of energy for 
chemosynthetic microbes. Hydrogen is a particularly favourable elec- 
tron donor as the energy yield from hydrogen oxidation is much 
higher than from methane oxidation, sulphur oxidation and all other 
potential electron donors for chemolithoautotrophic growth under 


standard conditions'*. In fact, our thermodynamic model predicts 
that at the Logatchev vent field, aerobic hydrogen oxidation could 
provide up to 7 times more energy per kilogram of vent fluid than 
methane oxidation, and up to 18 times more energy per kilogram of 
vent fluid than sulphide oxidation (Supplementary Fig. 1). 

At Logatchev, Bathymodiolus puteoserpentis mussels are by far the 
most abundant macrofauna’’. They live in a dual symbiosis with 
methane-oxidizing and chemoautotrophic sulphur-oxidizing bacteria 
that are hosted in their gills'*'°. Here we show that the sulphur- 
oxidizing symbiont of B. puteoserpentis uses hydrogen as an energy 
source. We also show that Bathymodiolus symbionts from basalt- 
hosted MAR vent fields can oxidize hydrogen. This ability is therefore 
not limited to the ultramafic-hosted vent fields and could also be 
powering chemosynthetic symbioses at basalt-hosted vent fields. 


Uptake hydrogenase genes in mussels 


The key enzymes involved in hydrogen metabolism are hydrogenases, 
which catalyse the reaction: Ho 2H* + 2e (ref. 17). Enzymes of 
the group 1 NiFe hydrogenases are membrane-bound respiratory 
enzymes that channel electrons from hydrogen into the quinone pool, 
providing the link between hydrogen oxidation and energy produc- 
tion'”'’. The large subunit of the membrane-bound uptake hydrogen- 
ase is encoded by the hupL gene. We amplified and sequenced this 
gene from symbiont-containing B. puteoserpentis gill tissues from 
Logatchev (Supplementary Fig. 3). The closest related sequence was 
the large subunit of the NiFe hydrogenase from the alphaproteobac- 
terium Oligotropha carboxidovorans (79.6% amino acid identity). O. 
carboxidovorans can grow chemolithoautotrophically under aerobic 
conditions using either CO or H2 as an electron donor’’. Phylogenetic 
analyses based on multiple treeing methods placed the enzyme from 
B. puteoserpentis in a well-supported cluster with other group 1 NiFe 
hydrogenases, showing the genetic potential for hydrogen oxidation 
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linked to energy generation in B. puteoserpentis endosymbionts. To 
examine whether the endosymbionts of mussels from hydrogen-poor 
habitats also have the genetic potential for hydrogen uptake, we tried 
to amplify the hupL gene from gill tissues of mussels from basalt- 
hosted vents and cold seeps that have fluids with low hydrogen con- 
centrations. Indeed, we could amplify the hupL gene from mussels 
from basalt-hosted vents, including undescribed Bathymodiolus 
mussels (Bathymodiolus spp.) from vents on the southern MAR 
(Wideawake at 4° 48'S, 5,600 km from Logatchev, and Lilliput at 
9° 33'S, 6,500 km from Logatchev), and B. aff. thermophilus from 
the Axial Dome vent on the Pacific-Antarctic Ridge at 37° 47'S 
(Supplementary Figs 2, 3 and Supplementary Table 7), showing that 
the genetic potential for hydrogen oxidation is not restricted to mussel 
symbionts from ultramafic-hosted vent fields that have high hydrogen 
concentrations. Intriguingly, we could not amplify this gene from any 
of the cold seep mussels investigated (see Supplementary Discussion). 


B. puteoserpentis symbionts use H» 

Given the genetic potential for hydrogen uptake in Bathymodiolus 
mussels, we incubated B. puteoserpentis mussel gill tissues from the 
ultramafic-hosted Logatchev vent field on the MAR with hydrogen at 
partial pressures of ~100p.p.m. in the headspace (0.08 1M H) dis- 
solved in the medium) and measured its consumption over time. 
Hydrogen was taken up rapidly by the symbiont-containing gill tissues 
ata rate of ~650 + 200nmolh ! (g wet weight) (n = 7). In contrast, 
symbiont-free foot tissue did not consume hydrogen at rates above the 
negative controls—boiled gill tissue and seawater (Fig. 1 and Sup- 
plementary Table 3). 


H, consumption 
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Because hydrogen uptake by bacteria is not necessarily coupled to 
CO, fixation’””°, we incubated mussel gill tissues with hydrogen in 
seawater containing *C-bicarbonate to determine whether hydrogen 
is an energy source for autotrophic CQ, fixation by B. puteoserpentis 
symbionts. Control gill tissues were incubated in the presence of 
sulphide or without an electron donor. '*C uptake was stimulated 
by sulphide, known to be an energy source for the sulphur-oxidizing 
symbionts of Bathymodiolus mussels’, and also by hydrogen (Fig. 1). 
The rates of carbon fixation with hydrogen were comparable to those 
supported by sulphide oxidation, which suggests that hydrogen could 
be fuelling autotrophy to the same extent as sulphide. We therefore 
conclude that hydrogen provides energy for the production of mussel 
biomass at the Logatchev hydrothermal vent field. 


Symbionts from basalt-hosted vents use Hp 


We examined whether the symbionts of Bathymodiolus mussels from 
basalt-hosted vent fields on the southern MAR at Comfortless Cove 
and Lilliput with low in situ hydrogen concentrations could also 
consume hydrogen. Hydrogen concentrations measured in discrete 
samples of the diffuse fluids from these vents were typically below 
0.1 uM (refs 22,23) as opposed to Logatchev, which had up to 154 uM, 
on the basis of discrete sampling (Supplementary Table 3). We incu- 
bated gill tissues of B. spp. mussels and measured hydrogen consump- 
tion over time as described earlier. Our results show that mussels from 
these vents were also able to take up hydrogen (Fig. 1 and Sup- 
plementary Table 3). However, the rates at which the symbiotic gill 
tissues of the southern MAR mussels consumed hydrogen were 20- to 
30-fold lower than those of mussels from Logatchev. 
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Figure 1 | Hydrogen consumption in Bathymodiolus gills. a, b, Hydrogen 
was consumed rapidly by B. puteoserpentis gills (Logatchev) and carbon 
fixation was stimulated after incubation with '“C-bicarbonate in the presence of 
hydrogen or sulphide. Each data point is one distinct measurement. 

c, Consumption rates of gills from two sites at Logatchev were in the same 


H, partial pressure (p.p.m.) 


H, partial pressure (p.p.m.) 


range. d, e, Rates from Logatchev and the southern MAR fields (B. spp.; 
Comfortless Cove, 4° 48’ S and Lilliput, 9° 33’ S) increased with increasing 
hydrogen partial pressures. Rates were roughly 20-30-fold higher at the 
hydrogen-rich Logatchev field than at the hydrogen-poor southern MAR vents. 
Error bars represent one standard deviation. 
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Effect of Hy concentration on consumption 


Some hydrogen-oxidizing microorganisms are known to induce 
hydrogenase expression only in the presence of hydrogen, and some 
express this enzyme constitutively at a low level, but increase its 
expression upon incubation with hydrogen™. To investigate the effect 
of dissolved hydrogen concentration on hydrogen consumption rates, 
we incubated gill tissues from the Logatchev and southern MAR 
venting sites at partial pressures of up to 3,000 p.p.m. (~2.3 UM dis- 
solved H2). Hydrogen consumption rates in mussels from Logatchev, 
Comfortless Cove and Lilliput increased by 135, 21 and 8 nmolh’' (g 
wet weight) ', respectively, for each 100 p.p.m. increase in partial 
pressure (Fig. 1). Thus, hydrogen uptake is clearly stimulated by 
increasing hydrogen concentrations. 


The sulphur-oxidizing symbiont uses H, 


Many phylogenetically diverse microorganisms can grow autotrophi- 
cally on hydrogen. Bathymodiolus mussels from the MAR host two 
types of chemosynthetic gammaproteobacteria: a sulphur oxidizer 
and a methane oxidizer’ (Fig. 2). In addition, these mussels have a 
gammaproteobacterial parasite that infects gill nuclei**. Hydrogen is 
known to provide energy for chemo- and photolithoautotrophic 


Figure 2 | The sulphur-oxidizing symbiont has the gene for hydrogen 
uptake, which it expresses. a, B. puteoserpentis with brown, symbiont- 
containing gills. b, Morphology of a single gill, each composed of many 
filaments (modified from ref. 48). c, FISH of a cross-section through a single gill 
filament (host cell nuclei, blue; sulphur-oxidizing symbiont, green; methane- 
oxidizing symbiont, red). d, e, hupL geneFISH (signals in green) with 16S rRNA 
FISH (signals in red) for the sulphur-oxidizing symbiont (d) and the methane- 
oxidizing symbiont (e). The negative control gave no gene signals 
(Supplementary Fig. 7). f, g, HupL immunohistochemistry (signals in green) 
with 16S rRNA FISH (signals in red) for the sulphur-oxidizing symbiont (f) and 
the methane-oxidizing symbiont (g). Yellow colours in d and f appear where 
green and red signals overlap. 
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growth of some free-living sulphur-oxidizing bacteria**’’. Some 


free-living methane-oxidizing bacteria can also oxidize hydrogen in 
addition to methane”. Lastly, hydrogen has been shown to be a key 
determinant of the virulence of pathogenic Helicobacter pylori”. 
Accordingly, it was unclear which of the three types of bacteria asso- 
ciated with Bathymodiolus mussels could be using the hydrogen. To 
investigate this further, we used three molecular methods allowing us 
to link identity with function at the DNA and protein level: genome 
sequencing of the mussel symbionts, single-gene fluorescence in situ 
hybridization (geneFISH) and immunohistochemistry, the latter two 
combined with 16S rRNA FISH. 

In the genomes of hydrogen-oxidizing bacteria, the genes for the 
large and small subunits of membrane-bound NiFe hydrogenases are 
often found clustered together with the genes necessary for bio- 
synthesis, maturation and processing'’. We used metagenomics to 
investigate hydrogen oxidation in the genome of Bathymodiolus 
symbionts from the southern MAR (Lilliput). We found a hydrogen- 
ase operon with 19 open reading frames on a 726-kb genome frag- 
ment (Supplementary Fig. 4). The large subunit gene from this 
genome fragment was 99.6% identical to the hupL gene amplified from 
the Wideawake (southern MAR) mussels using polymerase chain re- 
action (PCR), and 83% identical to the gene from B. puteoserpentis 
from Logatchev on the northern MAR. Homologues of all structural 
genes and those necessary for hydrogenase synthesis, assembly and 
function in the chemolithoautotrophic hydrogen oxidizer Cupriavidus 
necator” were present on our genome fragment. The southern MAR 
Bathymodiolus symbiont therefore has all of the genetic components 
required for hydrogen uptake. On the same genome fragment, we found 
the key genes for sulphur oxidation via the reverse dissimilatory sulphite 
reductase (rDsr) and Sox pathways, as well as for CO; fixation by the 
Calvin-Benson-Bassham (CBB) cycle (Supplementary Fig. 4), indi- 
cating that this fragment is from the sulphur-oxidizing symbiont of 
Bathymodiolus. 

To establish further that it is the sulphur-oxidizing symbiont that uses 
hydrogen, we used geneFISH, a method in which simultaneous detec- 
tion of single genes and rRNA enables the linking of function and 
identity at the single-cell level’. We detected the hupL gene in the 
symbiont-containing gill tissue of B. puteoserpentis (Fig. 2). GeneFISH 
signals from probes designed to target the hupL gene amplified by PCR 
from B. puteoserpentis overlapped with the 16S rRNA signals from the 
previously described sulphur-oxidizing symbiont (Fig. 2). This further 
suggests that the hupL gene we amplified is from the sulphur-oxidizing 
symbiont of B. puteoserpentis. 

To show that the sulphur-oxidizing symbiont expresses the uptake 
hydrogenase, we combined immunohistochemistry with FISH of the 
16S rRNA of the symbionts of B. puteoserpentis, thus linking identity 
and function at the level of gene expression. Using a polyclonal anti- 
serum against the membrane-bound hydrogen-uptake NiFe hydro- 
genase from C. necator (74% amino acid sequence identity to the 
B. puteoserpentis symbiont HupL; Supplementary Fig. 5), we were 
able to detect hydrogenase expression in single symbiont cells in 
B. puteoserpentis (Fig. 2). Signals from the anti-hydrogenase antibody 
were seen in gill bacteriocytes of B. puteoserpentis where they over- 
lapped with the FISH signals from the sulphur-oxidizing symbiont, 
but not with those from the methane-oxidizing symbiont. 


Environmental significance of Hy use 

One of the major challenges in hydrothermal vent research is to link 
physiological experiments, performed on board the research vessel at 
atmospheric pressure, with processes occurring under in situ temper- 
ature and hydrostatic pressure. We deployed an in situ mass spectro- 
meter (ISMS) to measure hydrogen concentrations in the mussel 
habitat at Logatchev, and compared these with temperature as a con- 
servative tracer for mixing of the hydrothermal fluids with seawater. 
Simultaneous measurements of hydrogen concentrations and tem- 
perature were made in two different settings; first, in an area of 
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focused flow, directly at the source where the fluids exit the seafloor. 
Here, the fluids had not been exposed to B. puteoserpentis mussels or 
other macrofauna. Second, we measured hydrogen concentrations 
and temperature in an area where the fluids had passed through a 
B. puteoserpentis mussel bed. The slope of the regression line calcu- 
lated for hydrogen concentration versus temperature was signifi- 
cantly lower in the mussel bed compared to the source fluid 
(P < 0.0001), indicating that fluids in the mussel bed are hydrogen- 
depleted compared to the source fluid (Fig. 3). This difference is 
probably due to hydrogen consumption by the sulphur-oxidizing 
symbionts of B. puteoserpentis. We observed the same effect with 
measurements of methane and temperature (P < 0.0001; Supplemen- 
tary Fig. 6). These measurements confirm that significant amounts of 
hydrogen are being consumed in Logatchev mussel beds. 

The terrestrial hydrogen biogeochemical cycle has been the topic of 
numerous studies (summarized in ref. 32), and interest in the global 
hydrogen cycle has recently grown with the prospect of a hydrogen 
economy’. In contrast to the terrestrial environment, there is a paucity 
of data on hydrogen turnover in the oceans. Conservative calculations 
based on our measurements of uptake rates in Logatchev mussels show 
that at 50 1M dissolved hydrogen a single mussel with a gill weight of 
5g could oxidize up to 435 mol Hh’ (Supplementary Informa- 
tion). On the basis of our own estimates and those of others, the mussel 
population at Logatchev spreads in layers over at least 200 m* and 
accounts for most of the invertebrate biomass’. One square meter is 
covered by approximately 1,100-2,500 adult individuals correspond- 
ing to a total population of 250,000 to 500,000 individuals. This popu- 
lation could consume between 0.5 and 1.0 mol H, m *h |, that is, in 
total up to 200 mol H,h '. Previous studies have reported hydrogen 
uptake rates in the range of several nmol ml 'h * in hydrothermal 
fluids**, and several nmolm *h ! in coastal waters*®. On the basis 
of the limited data available for comparison, the symbionts of 
Bathymodiolus mussels from the MAR probably have a role as a sig- 
nificant hydrogen sink at these hydrothermal vents. 

We do not know if the chemoautotrophic symbionts of vent mussels 
from other mid-ocean ridges are actively consuming hydrogen, 
although we show here that B. aff. thermophilus from the Pacific— 
Antarctic Ridge has the key gene necessary. Hydrogen is clearly present 
in the fluids at many basalt-hosted vents**”’, and could well be powering 
biomass production at these sites as well. In addition, use of hydrogen as 
an energy source may also have a role in other chemoautotrophic 
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Figure 3 | Hydrogen is consumed in mussel beds of B. puteoserpentis. Plot 
of hydrogen concentrations versus temperature, measured by in situ mass 
spectrometry in the source fluid and in a B. puteoserpentis mussel bed. The 
unbroken line shows linear regression analysis for the source fluid 

(y = 83.308x — 240, R* = 0.63), the broken line shows linear regression analysis 
for the mussel bed fluid (y = 43.576x — 120, R 0.56). The difference in the 
slopes of the regression lines between the source fluid and in the mussel bed is 
due to the consumption of hydrogen in the mussel bed. 
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symbioses; the episymbionts of the hydrothermal vent shrimp 
Rimicaris exoculata from MAR vent fields have hupL genes** (Sup- 
plementary Fig. 3), as does the endosymbiont of the giant tubeworm 
Riftia pachyptila from basalt-hosted vents in the East Pacific, which it 
expresses in situ (S. Markert, personal communication). This indicates 
that hydrogen use could be widespread in chemosynthetic symbioses. 


METHODS SUMMARY 


A detailed description of the sampling sites (Supplementary Tables 1 and 2) and 
all methods used in this study can be found in the Supplementary Information. 

For amplification of the hupL gene, we extracted DNA from gill samples” and 
used published primers**. Cloning and sequencing were done as described else- 
where*’. Maximum likelihood and parsimony HupL phylogenies were calculated 
in ARB” using a MAFFT® alignment. For incubation experiments, Bathymodiolus 
mussels were collected from hydrothermal vent fields on the MAR by remotely 
operated vehicles and incubated immediately after recovery on board the research 
vessel. Mussel gill and foot tissues were incubated in sterile-filtered seawater in glass 
serum vials (Supplementary Table 4). Hydrogen was added to the vials, and the 
change in headspace concentration was measured over time by gas chromatography. 
CO) fixation was measured by incubation of gill tissues with ‘*C-bicarbonate after 
adding H2, Na2S (for H2S/HS_ ), or no electron donor. The incorporation of labelled 
4C uptake over time was measured by liquid scintillation counting in separate tissue 
pieces from the same individual that were incubated for 0, 30, 60 and 120 min. 
Mussels for genome sequencing were collected from the Lilliput vent site on the 
MAR. Gill tissue was immediately homogenized and fixed on board. We enriched 
for symbiont cells by filtering sequentially through 8, 5, 3 and 2 um GTTP filters. We 
extracted DNA” and constructed a 6 kb paired-end library, which was sequenced 
by 454-Titanium and Illumina. Titanium reads were assembled using Newbler 
(v.2.3-PreRelease-10/19/2009-454; Life Sciences Corporation). One 726 kb scaf- 
fold was identified that contained the hydrogenase operon, and this scaffold was 
annotated by RAST“*. Polynucleotide probe design and geneFISH were done as 
previously described*’**, with modifications listed in Supplementary Information 
(Supplementary Tables 5 and 6). Immunohistochemistry was done by microwav- 
ing gill sections of B. puteoserpentis, blocking in western blocking reagent (Roche), 
incubating in primary antibody, then in HRP-conjugated secondary antibody. 
Signal amplification was done by CARD**. The ISMS was deployed as previously 
described” with modifications listed in Supplementary Information. Statistical 
analysis was done with the software JMP5. 
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Microwave quantum logic gates for trapped ions 


GC: Ospelkaus't, U, Warring’, Y. Colombe!, K. R. Brown!}, J. M. Amini‘, D. Leibfried! & D. J. Wineland!' 


Control over physical systems at the quantum level is important in 
fields as diverse as metrology, information processing, simulation 
and chemistry. For trapped atomic ions, the quantized motional 
and internal degrees of freedom can be coherently manipulated with 
laser light’’. Similar control is difficult to achieve with radio- 
frequency or microwave radiation: the essential coupling between 
internal degrees of freedom and motion requires significant field 
changes over the extent of the atoms’ motion””, but such changes are 
negligible at these frequencies for freely propagating fields. An 
exception is in the near field of microwave currents in structures 
smaller than the free-space wavelength*’, where stronger gradients 
can be generated. Here we first manipulate coherently (on time- 
scales of 20 nanoseconds) the internal quantum states of ions held 
in a microfabricated trap. The controlling magnetic fields are 
generated by microwave currents in electrodes that are integrated 
into the trap structure. We also generate entanglement between the 
internal degrees of freedom of two atoms with a gate operation*** 
suitable for general quantum computation’; the entangled state has 
a fidelity of 0.76(3), where the uncertainty denotes standard error of 
the mean. Our approach, which involves integrating the quantum 
control mechanism into the trapping device in a scalable manner, 
could be applied to quantum information processing*, simu- 
lation®’° and spectroscopy””. 

The quantized mechanical motion and internal states of trapped 
atoms can be coherently controlled by laser radiation. This optical 
control has led to the creation of non-classical motional states’, 
multiparticle entanglement’, scalable quantum logic techniques’’, 
quantum simulation studies'*’° and quantum logic spectroscopy". 
The coupling between internal and motional states is provided by 
the gradient of the driving field over the extent, Ax, of the atoms’ 
motion’. For trapped atomic ions in their lowest motional quantum 
levels, Ax~10 nm, which is a significant fraction of an optical wave- 
length, resulting in an efficient coupling between motional and 
internal states. A microwave field propagating in free space varies only 
very slightly over Ax, and the corresponding coupling between 
motional and internal states is highly suppressed. 

Notwithstanding the success of laser-based control, it is desirable to 
develop a microwave approach with equivalent capabilities because of 
the relative ease of generating and controlling these fields. One method 
uses a combination of uniform microwave fields and static, state- 
dependent potentials to provide motional-internal state coupling’*”” 
and individual addressing’*”°. Related to this are magnetic microwave 
or radio-frequency-dressed state potentials for neutral atoms*’. Here 
we implement the approach outlined in ref. 4, where the magnetic field 
from oscillating microwave currents in trap electrodes provides coher- 
ent manipulation of the internal state, and the gradient of this field leads 
to motional-internal state coupling. These interactions suffice for 
universal quantum information processing, because any unitary 
operation on quantum bits (qubits) can be decomposed into a sequence 
of single-qubit rotations and a suitable entangling interaction’. By 
comparison with laser-based approaches, microwave control has the 
potential to improve significantly the fidelity of operations owing to 
its reduced sensitivity to ‘spectator modes’ of ion motion’, its 


better control of field amplitudes and phases, and the absence of 
spontaneous-emission decoherence*’®. Furthermore, microwave 
near-field control could be incorporated in a chip-level library of trans- 
port, junction, storage and microwave manipulation components” to 
advance the integration of quantum control in scalable quantum 
information processing or simulation. 

Our apparatus comprises a room-temperature (300K) surface- 
electrode ion trap”’ with 10-1m-thick gold electrodes, separated by gaps 
of 4.5 um, deposited onto an insulating AIN substrate (Fig. 1). An oscil- 
lating potential (amplitude, 25-50 V; frequency, fr = 70.97 MHz), 
applied to the radio-frequency electrodes (Fig. 1, RF1 and RF2), pro- 
vides pseudopotential confinement of "Mg" ions in the radial (x and 
z) directions at a distance of 30 tm from the electrode surface (all other 
electrodes are held at radio-frequency ground). Along the trap y axis, 
ions are confined with static potentials applied to control electrodes 
C1, C3, C4 and C6. The radial field resulting from these potentials is 
compensated for by suitable potentials applied to electrodes C2 and C5. 
Single-ion oscillation frequencies along the y direction can be adjusted 
between 300 kHz and 2 MHz, and radial frequencies can be adjusted 
between 4 and 10 MHz. Microwave electrodes MW1, MW2 and MW3 
support currents, of order 100 mA to 1 A, that produce an oscillating 
magnetic field, B,,.. above the surface and are used to implement 
quantum control. To minimize thermal effects during these microwave 
pulses, we chose AIN as the electrode substrate. It provides a strong 
thermal link between the gold electrodes and a solid copper support 
through a thin layer of vacuum-compatible epoxy. A static magnetic 
field, Bp, parallel to the trap surface and at an angle of 15° with 
respect to the z axis, provides the internal-state quantization axis. 
Superimposed circularly (o )-polarized laser beams nearly resonant 


Figure 1 | Central portion of the surface-electrode trap. The control 
electrodes are labelled C1 to C6, trap radio-frequency electrodes are labelled 
RF1 and RF2, and microwave lines are labelled MW1 to MW3. The ions are 
held a distance d = 30 um above the surface. By comparison with the 
symmetric trap structure of ref. 4, the asymmetric geometry of the microwave 
and radio-frequency electrodes used here yields approximately equal 
microwave currents in all three electrodes for field suppression at the ions while 
providing a deeper trap. 


Time and Frequency Division, National Institute of Standards and Technology, 325 Broadway, Boulder, Colorado 80305, USA. +Present addresses: Institute of Quantum Optics, Leibniz Universitat 
Hannover, Welfengarten 1, 30167 Hannover, and PTB, Bundesallee 100, 38116 Braunschweig, Germany (C.O.); GTRI Georgia Tech, 400 10th Street NW, Atlanta, Georgia 30318, USA (K.R.B., J.M.A.). 
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with 37S,» 3°P3/2 transitions and propagating parallel to By are used 
for optical pumping, Doppler laser cooling and state detection through 
resonance fluorescence. Optical pumping prepares ions in the |F = 3, 
Mp = —3) hyperfine ground state. Using microwave pulses resonant 
with hyperfine transitions, we can prepare an arbitrary pure state 
within the ground-state manifold (Fig. 2a) with fidelity greater than 
0.98 (Methods). Furthermore, we can detect the population in any one 
of these states by applying a series of microwave 1-pulses to transfer its 
population to |3, —3) and then detecting resonance fluorescence on the 
|3, —3)— |3?P3/2, F = 4, mp = —4) optical cycling transition. 

At a bias field of Bp~21.3 mT, we realize a first-order magnetic- 
field-independent transition between the qubit states |3, 1) =| |) and 
|2, 1) =|1) with ‘carrier’ frequency fy~1.69 GHz (Fig. 2a, solid line). 
To prepare ||), after optically pumping the ions to |3, —3), we apply 
four microwave 1-pulses (Fig. 2a, dotted lines). Figure 2b shows 
Ramsey spectroscopy measurements of the ||)—> |) transition fre- 
quency as a function of magnetic field detuning, 5B, from Bp. At 
5B = 0, the transition frequency depends on 8B only to second order, 
a feature that has been shown to allow long coherence times (~ 10s) 
despite moderate ambient magnetic field noise”. 

Rabi flopping on the ||)—>|{) transition with a complete state 
exchange time (1-time) of 18.63(3) ns (Fig. 2c) demonstrates the speed 
of single-qubit operations. This is more than 200 times shorter than a 
typical m-time (4|1s) obtained with continuous-wave Raman laser 
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Figure 2 | Level scheme of ?°Mg* (nuclear spin, I = 5/2) and spectroscopy of 
the qubit transition. a, Ground-state hyperfine structure of "Mg" at 
Bo~21.3 mT and the microwave transitions used in the experiment. h, Planck’s 
constant. b, Frequency of the ||) <> |) transition as a function of bias-field 
detuning, 5B, from the field-independent point, Bo. The standard errors of the 
measured frequencies are less than 5 Hz. c, Rabi flopping on the ||) <> |T) 
transition induced by microwave current in MW2, the electrode in closest 
proximity to the ions. Error bars, s.e.m. for the average photon count in 400 ils. 
Tq» 1-time. 
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beams”. Moreover, it is nine orders of magnitude shorter than the 
coherence time achieved on a similar field-independent transition”. 
(For very short m-pulses realized with pulsed lasers, see ref. 26.) 

To achieve coupling between motional and internal states, we apply 
sufficiently large oscillating magnetic field gradients at frequencies 
Ss™fo +f near the motional sidebands of the qubit transition carrier 
frequency, fo (where f, is a radial motional-mode frequency). At the 
same time, we want to suppress B,,. at the ions because it can cause 
undesirable off-resonant carrier excitation and a.c. Zeeman shifts, 
analogous to a.c. Stark shifts for optical fields*. In the geometry of 
Fig. 1, there is a single combination of relative current amplitudes 
and phases in electrodes MW1, MW2 and MW3 that provides an 
oscillating magnetic field gradient at the ions without an oscillating 
magnetic field. 

We find this combination by adjusting the amplitudes and phases of 
the currents to minimize the a.c. Zeeman shifts on the qubit and 
neighbouring hyperfine transitions. To characterize the resulting field, 
we map the x-z spatial dependence of the a.c. Zeeman shift on a 
selected magnetic dipole hyperfine transition by displacing the ion 
radially from its nominal position with adjustments of the static trap 
potentials. Within a small region around the oscillating field zero, we 
expect Bos. to be a quadrupole characterized by a gradient strength, B’, 
and an angle, « (the angle of one principal axis of the quadrupole with 
respect to the x axis; see Methods). A fit based on this model and to the 
experimental a.c. Zeeman shift data yields B’ = 35.3(4) Tm ! and 
& = 26.6(7)°. For comparison, a plane-wave microwave field with 
the amplitude required to produce the m-time of Fig. 2c (1.9 mT), 
has a gradient of only 0.068Tm ’. Knowledge of the orientation 
and gradient of the field can be used to predict sideband Rabi rates 
and to optimize the overlap with one of the radial motional modes. 
From the measured gradient and the assumption of optimal alignment 
of the mode with the gradient, we calculate a sideband 1-time* of 
approximately 190 us at a motional frequency of 6.5 MHz for a single 
ion in its motional ground state (Methods). 

After approximately optimizing the field configuration, we imple- 
mented motional sideband transitions on one ion and ona pair of ions. 
As an example of a two-ion sideband application, we perform the 
following experiments on the selected radial rocking mode. First, 
two ions are Doppler-cooled and prepared in the ||) state. We then 
apply a qubit m-pulse followed by a sideband pulse at f, = fo + f; (here 
jf: 6.8MHz for the chosen rocking mode). The sideband pulse 
induces transitions from |||,m) to |{|,n+1), |l7,n+1) and 
|?1,2 +2), where n denotes the motional quantum number”. 
Populations in | |) are subsequently transferred to |3, —3) and detected 
by laser fluorescence. In Fig. 3a (blue circles), we show the result of a 
frequency scan of the sideband pulse. We then repeat the above experi- 
ment, but swap the order of the sideband pulse and the carrier m-pulse 
(Fig. 3a, red squares). In this case, the sideband pulse drives transitions 
from |}J,) to || },2— 1), |Tl,n — 1) and |||, — 2). If we assume a 
thermal motional state (valid for Doppler cooling), a fit to the data 
gives an average occupation number of i =2.2(1). 

Multiple sideband cooling cycles can be used to cool a selected 
motional mode to near the ground state””’”. One sideband cooling cycle 
consists of transitions from |{7,n) to ||?,2—1), |TJ,n—1) and 
|| |, — 2), where the pulse duration (250 1s) is optimized for maximal 
sideband asymmetry. This is followed by a repumping laser pulse 
resonant with the ||)— |3°Ps/2., my = —3/2, m,= 3/2) transition 
(where my, and m; are the components of the total electronic and 
nuclear angular momenta parallel to the static magnetic field, Bo). 
From the relevant Clebsch-Gordan coefficients, we predict a 97% 
repumping efficiency to |{) after scattering 20 photons on average. 
Figure 3b shows a frequency scan of the motion-adding and 
motion-subtracting sidebands after four sideband cooling cycles. 
Assuming an approximately thermal distribution after cooling”, we 
find that n =0.6(1), confirming the sideband cooling. Lower tempera- 
tures could not be achieved by adding more cooling cycles, and the final 
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Figure 3 | a motional sideband transitions. a, Motion-subtracting 
Itt a {|tLa- — 2)} (red squares) and motion- 
adding || |,n)> i 
transitions for a Doppler-cooled two-ion radial rocking mode. b, Sideband 
transitions after four sideband cooling cycles. Error bars, s.e.m. for the average 
photon count in 400 hs. 


value of i was most probably limited by the repumping photon recoil 
and by heating from ambient electric field noise” during the course of 
the sequence. Separately, we measured the relevant motional heating 
rate and found it to be as low as di/dt =0.2 ms~! at St = 7.7 MHz; 
however, higher heating rates were also observed. We did not detect 
additional heating induced by the microwave pulses. 

Simultaneous application of two field gradients at fy + (f, + 6), with 
o<f;, is used to implement an entangling two-qubit gate*. By com- 
parison with light fields, derivatives of the microwave fields of order 
higher than linear have negligible effect on the ions’, so the ions are 
only Doppler-cooled and prepared in the || |) state. We then apply the 
two oscillating field gradients with f, = 7.6 MHz (rocking mode) and 
6 = 4.9 kHz. To suppress decoherence caused by motional frequency 
instability, we apply the fields in two pulses of duration 200 1s, revers- 
ing the phase of one field in the second pulse**. For properly chosen 
pulse durations, the internal states ideally end in the entangled state 
|) =(1/V/2)(\\1) —itT)), with the motion restored to the state it 
was in before the gate operation’®. Pulse imperfections and decoherence 
during the operation produce a mixed state with density matrix p (after 
tracing over the motion). We characterize the fidelity of p with respect 
to the ideal outcome, F, = (Y|p|¥) = (1/2)(prp47 + Pri) + [Pree 
by observing population oscillations as we sweep the phase, ¢, of a 1/2- 
pulse applied to the qubits after creating the entangled state’. The 
populations p)) 1), Prt) + Plrit and p74,77 can be directly deter- 
mined by fitting histograms for two, one or zero fluorescing ions to 
the histograms of the total fluorescence signal obtained during the 
sweep (Methods). The resulting populations are shown in Fig. 4a. 
The magnitude of the density matrix element p;;,), is equal to the 
amplitude, Ay, of the parity I= (P\y1) + Pijyy) — (Pity + Pip), 
which oscillates like A ;7cos(2 + 9) (ref. 29; Fig. 4b). By fitting sinu- 
soids to the observed populations, we extract a fidelity of F, = 0.76(3) 
with respect to the state | ¥). 
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Figure 4 | Populations and parity of the entangled state. After the 
entanglement operation is applied to the I state, an analysis 1/2-pulse with 
relative phase ¢ is applied. Ideally, 
superpositions of | Y%) and (1/ 2)(( TL +N), anid parity and populations of 
the two ions oscillate at cos(2p + $9). a, Populations in | | 7) (P) ;;), red disks), 
[L.1) (P) 4. blue tiamoees) and |{ |) rs |Lt) Pip +P} y green squares). 
b, Parity T= (P\ pj) + Py \y) — (Pi py + Pi 7p). Each data point represents an 
average of 300 experiments. We determine an entangled-state fidelity of 

F, = 0.76(3) from the populations in a. Error bars, s.e.m. for the populations 
and parity. 


For scalable implementations of quantum information processing, 
the quality of all operations demonstrated here, in particular that of the 
entangling operation, must be improved considerably. This will 
require significant technical improvements; however, there are no 
apparent fundamental limits to fidelity. The most important factors 
currently limiting two-qubit gate fidelity are the stability of motional 
frequencies in the trap and the precision with which we can suppress 
Bosc at the positions of the ions. Fluctuations of the motional frequencies 
are of the order of 1 kHz during the gate, which is a sizeable fraction of 
the gate detuning, 6, and seem to be primarily caused by time-varying 
stray potentials on the trap chip. It should be possible to suppress such 
stray fields by improving the surface quality of the electrodes, by redu- 
cing the amount of nearby dielectric material (dust and the residue of 
materials used during fabrication), by more carefully cleaning the elec- 
trode surfaces and by minimizing the exposure of the electrodes to 
ultraviolet light, which can generate charge through photoemission. 
Weare able to suppress the oscillating field at the positions of the ions 
to fluctuations with amplitudes of approximately 10 uT, implying an a.c. 
Zeeman shift of less than 1 kHz. These fluctuations could be further 
suppressed by improving the amplitude stability and shaping of the 
microwave pulses and by using more sophisticated microwave current 
leads. 

The presence of many processing zones on the same chip can intro- 
duce cross-talk between zones. Cross-talk for two-qubit gates can be 
suppressed by choosing substantially different mode frequencies for 
qubits in nearby ‘spectator’ trap zones. For single-qubit rotations, it 
should be possible to reduce the field to near zero in spectator zones by 
applying local compensating fields. Composite pulse techniques can 
also prove useful in enhancing spectator-zone isolation*’. Traps smaller 
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than those used here should lead to faster operations’, provided that 
anomalous motional heating can be suppressed. Multi-ion operations 
similar to those demonstrated here might also be used for quantum 
simulation™”®, possibly with less stringent requirements on suppression 
of cross-talk. For particles lacking the electronic structure needed for 
laser manipulation and read-out, microwave sidebands would allow 
excitation of the motion conditioned on the internal-state magnetic 
moment*’. The presence of the motional excitation could then be 
observed with an ancilla ion, either through a shared motional mode"! 
or through the Coulomb interaction between ions held in separate 
traps**'~**. In the second case, this technique might allow a comparison 
of the proton and antiproton magnetic moments*”’. 


METHODS SUMMARY 


We describe the oscillating magnetic field geometry in the vicinity of the ions, and 
the experimental determination of this geometry, by mapping out a.c. Zeeman 
shifts experienced by the ions as functions of their positions. See the Methods for 
more details. There we also explain how state populations are extracted from 
fluorescence measurements to derive the fidelity of the entangled state produced 
by the two-qubit gate. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Determination of the local microwave field. Within a small region {5x, dy, 5z} 
around the oscillating microwave field null point, we expect Bos. to be a quadrupole 


cos(2a) 0 — sin(2a) Ox 
Bosc = B’ cos (2f,t) 0 0 0 oy 
sin(2x) 0 — cos(2a) Oz 


in the coordinate system of Fig. 1, where B’ and a (the angle of one principal axis of 
the quadrupole with respect to the x direction in Fig. 1) characterize the strength 
and orientation of the quadrupole, respectively. We map the x-z spatial depend- 
ence of the a.c. Zeeman shift on the magnetic dipole transition |3, 1) |2,0) by 
displacing the ion radially from its nominal position with adjustments of the 
control electrode potentials that are derived from simulations. We determine 
the a.c. Zeeman shifts for a set of radial displacements {6x, dz} and find that 
B’ = 35.3(4)Tm !anda = 26.6(7)° froma fit to this data. Asa consistency check, 
we probe the relevant one-ion sideband with frequency near 6.5 MHz and find a 
m-time for the ion in its motional ground state of approximately 260 [1s (extracted 
by a model fit). The discrepancy of 27% between the observed 1-time and that 
inferred from the measured gradient is most probably due to a misalignment 
between the maximal gradient direction and the axis of the interrogated motional 
mode. (To maintain a reasonable trap depth, the motional mode could not be 
perfectly overlapped with the maximal gradient.) 

Determination of populations from state-dependent fluorescence. During one 
detection period (duration, 100 1s), we typically detect ~0.3 counts on average if 
both ions are projected into |{), and approximately 11 additional counts for each 
ion in state ||). 
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Reference histograms are derived by recording photon counts for an on- 
resonance Ramsey experiment with two ions where the phase, 0, of the second 
m/2-pulse was varied. We establish in independent experiments that the net effect 
of transferring in and out of the qubit basis, and the Ramsey pulses, gives popu- 
lation errors not exceeding 0.02. We therefore assume the respective probabilities 
of observing zero, one or two fluorescing ions to be 


Po(0) =cos*(0/2) 
P,(0) =sin?(0)/2 
P,(0) =sin*(0/2) 


We model each histogram for two, one or zero fluorescing ions as a sum of three 
Poisson distributions with variable weights and average counts. The weights and 
averages are determined by a simultaneous least-squares fit to the experimentally 
determined histograms for all phases 0. To reduce the number of free parameters, 
we fix the two-ion average to be twice the one-ion average. With this choice, the fit 
contains eight free parameters and has a reduced 7” value of 1.34. Using the histograms 
representing two, one and zero fluorescing ions, we do a second least-squares fit to the 
histograms obtained while scanning the phase, ¢, of the 1/2-pulse following the 
entangling gate. Each population is fitted to functions a,cos(2 + ¢) + do, where 
a, and do, (k = 0, 1 or 2 fluorescing ions) are pairs of constants and #o is a common 
phase offset. The least-squares fits are shown in Fig. 4a. These fits directly yield the 
parity amplitude, Az, the phase offset, po, and the offset from zero shown in Fig. 4b. 

The final uncertainty in the fidelity, F,, is derived from the uncertainties in the 
population fits. In addition to the data analysis described above, we apply several 
other parameterizations of the histograms to fit to the Ramsey reference experi- 
ment. In all instances, the resulting gate fidelities agree with our stated result to 
within the respective uncertainties. 
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Quantum gates and memory using 


microwave-dressed states 


N. Timoney’, I. Baumgart', M. Johanning’, A. F. Varon', M. B. Plenio, A. Retzker? & Ch. Wunderlich’ 


Trapped atomic ions have been used successfully to demonstrate’ 
basic elements of universal quantum information processing. 
Nevertheless, scaling up such methods to achieve large-scale, universal 
quantum information processing (or more specialized quantum 
simulations” *) remains challenging. The use of easily controllable 
and stable microwave sources, rather than complex laser systems*’, 
could remove obstacles to scalability. However, the microwave 
approach has drawbacks: it involves the use of magnetic-field- 
sensitive states, which shorten coherence times considerably, and 
requires large, stable magnetic field gradients. Here we show how to 
overcome both problems by using stationary atomic quantum states 
as qubits that are induced by microwave fields (that is, by dressing 
magnetic-field-sensitive states with microwave fields). This permits 
fast quantum logic, even in the presence of a small (effective) Lamb- 
Dicke parameter (and, therefore, moderate magnetic field gradients). 
We experimentally demonstrate the basic building blocks of this 
scheme, showing that the dressed states are long lived and that coher- 
ence times are increased by more than two orders of magnitude rela- 
tive to those of bare magnetic-field-sensitive states. This improves the 
prospects of microwave-driven ion trap quantum information pro- 
cessing, and offers a route to extending coherence times in all systems 
that suffer from magnetic noise, such as neutral atoms, nitrogen- 
vacancy centres, quantum dots or circuit quantum electrodynamic 
systems. 

Using laser light for coherent manipulation of quantum bits (qubits) 
gives rise to fundamental issues, notably unavoidable spontaneous 
emission that destroys quantum coherence*’. The difficulty of cooling 
a collection of ions to their motional ground state and the long dura- 
tion of this process in the presence of spurious heating of Coulomb 
crystals limits the fidelity of quantum logic operations in laser-based 
quantum gates and thus hampers scalability. This limitation is only 
partly removed by the use of ‘hot’ gates’®””. Technical challenges in 
accurately controlling the frequency and intensity of laser light as well 
as in targeting a large number of laser beams of high intensity on 
trapped ions are further obstacles to scalability. 

These issues associated with the use of laser light for scalable QIP 
have led to the development of novel ways of performing conditional 
quantum dynamics with trapped ions that rely on radio-frequency or 
microwave radiation instead of laser light®”!”"'*. Radio-frequency or 
microwave radiation can be used in quantum gates through the use of 
magnetic-gradient-induced coupling between spin states of ions'®, 
thus averting the technical and fundamental issues of scalability that 
were described above. Furthermore, the sensitivity to motional excita- 
tion of ions is reduced in such schemes. A drawback of magnetic- 
gradient-induced coupling is the necessity of using magnetic-field- 
sensitive states for conditional quantum dynamics, thus making qubits 
susceptible to ambient field noise and shortening their coherence 
times. This issue is shared with some optical ion trap schemes for 
QIP that usually rely on magnetic-field-sensitive states for conditional 
quantum dynamics, such as geometric gates’, limiting the coherence 
time of qubit states typically to a few milliseconds. In an effort to 
extend the coherence times of atomic states, two-qubit entangled states 


forming a decoherence-free subspace have been created’”"’. Recently, 
transfer between field-sensitive states used for conditional quantum 
dynamics and field-insensitive states used for storage of quantum 
information has been utilized”’. 

The relevant noise source in this case, namely magnetic field fluc- 
tuations, is not featureless white noise but tends to have a limited 
bandwidth. In this context, techniques were proposed for prolonging 
coherence times by subjecting the system to a rapid succession of 
pulses leading to a decoupling from the environment. This technique, 
termed bang-bang control’, and its continuous version” can be 
applied usefully in a variety of systems including hybrid atomic physics 
and nanophysics technologies. Recent work includes the experimental 
demonstration of optimized pulse sequences made for suppression of 
qubit decoherence (see refs 22, 23 and references therein). 

Here we encode qubits in microwave-dressed states requiring only 
continuous, constant-intensity microwave fields. This scheme protects 
qubits from magnetic field fluctuations and, importantly, simultaneously 
allows fast quantum logic even for small Lamb-Dicke parameters and, 
therefore, moderate magnetic field gradients. Microwave-generating 
elements for the coherent manipulation of qubits can be integrated into 
microstructured ion traps’ such that QIP can be realized using scalable 
ion chips. Thus, this novel scheme is a significant step towards the 
integration of elements required for QIP on a scalable ion chip. 
Moreover, the ideas presented here are generic and can be applied to 
all laser- or microwave-based QIP. 

We theoretically describe the scheme for storage and single- and 
multiqubit quantum gate operation, and then present experimental 
demonstrations of storage and information processing of quantum 
information that demonstrate gains of more than two orders of mag- 
nitude in coherence times over approaches involving bare states. 

We consider a typical energy-level configuration (Fig. 1a). Encoding 
quantum information in the subspace spanned by two magnetically 
sensitive mp = +1 states, |—1) and |+1), will lead to a rapid loss of 
coherence owing to fluctuating magnetic fields. We first note that 
microwave-dressed states create a subspace spanned by the two states 
|D) =(|—1) —|+1))/V2 (a dark state) and |0’) (mp = 0) of the same 
manifold. This subspace is separated from the other eigenstates of 
the system, |u) and |d), by a finite energy gap (the dressing fields’ 
Rabi frequency is denoted by Q, and |A_ — 4,|<Q should hold 
for the detunings 4_ and A; refs 24, 25). The energetically degenerate 
states |D) and |0’) are not coupled by magnetic field fluctuations. As a 
consequence of the energy gap between these states and the states |) 
and |d), a relative phase change between |—1) and |+1) acquires an 
energy penalty and dephasing is strongly suppressed as long as the 
spectral power density of the magnetic field fluctuations at the fre- 
quency corresponding to the energy gap is negligible. We note that the 
levels |D) and |0') are energetically degenerate independent of the 
applied microwave Rabi frequency, 2, and are hence stable against 
its power fluctuations. Therefore, we consider the qubit encoded in 
the subspace spanned by |D) and |0’). The microwave fields do not 
couple any states in the qubit subspace and thus do not limit the phase 
coherence. 
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Figure 1 | Microwave-dressed qubit states. Left, atomic states and 
microwave-dressing fields with Rabi frequency Q. |—1) and |+1) represent 
magnetic-field-sensitive states, and have, for instance, mp = +1, and |0) and 
|0’) represent magnetically insensitive levels (and are, for instance, the mp = 0 
states of two different hyperfine manifolds). Right, dressed states. 

|D) =(|—1) —|+1))/V2 and |0’) are separated by an energy gap from two 
other states, |v) = (|B) +|0)) //2 and |d) = (|B) —|0)) //2, where 

|B) =(|—1) +|+1))/V2. Thus, a qubit with states |D) and |0’) is created that 
is resilient to magnetic field fluctuations and to microwave power fluctuations. 
a, A radio-frequency field with Rabi frequency Q, is used to implement general 
single-qubit quantum gates. b, Conditional quantum dynamics coupling 
electronic degrees of freedom to the motion of trapped atoms is achieved when 
the radio-frequency field is detuned from the qubit’s carrier transition 
frequency by the vibrational frequency, v. 


The nature of the protected subspace depends on the relative phase 
between the microwave fields on the transitions |0) <> |+1). The above 
description applies for a relative phase of zero, whereas for a relative phase 
of x the protected subspace is spanned by |B) =(|—1) +|+1))/V2 
and |0’) (Methods). We considered both cases in the experiments 
reported below. 

The application of one or two additional radio-frequency fields with 
Rabi frequency Q,, possible detuning 4, and a possible initial relative 
phase of m (Methods) then permits the implementation of general 
single-qubit rotations in the protected subspace spanned by |B) and 
|0") in the case of one additional field and by |D) and |0’) in the case of 
two additional fields. Choosing 4,=0 yields the Hamiltonian 
H= Q,{|D)0'| + |0")D|), which in a Bloch-sphere picture describes 
arbitrary rotations about the x axis in the protected space. A rotation 
about the z axis is obtained for 4, ~ 0 and 2, # 0 under the conditions 
Q, <A 7 andv<A 7 where v is the vibrational mode frequency of the 
harmonically trapped ions’ motion. The specific case in which A, = v, 
that is, tuning to the motional sideband, will be discussed later as it 
corresponds to coupling of the electronic degree of freedom to the 
motional degree of freedom. 

Hyperfine states of electrodynamically trapped ‘7'Yb™ ions char- 
acterized by quantum numbers F=0 and F=1 in the electronic 
ground state S;/2 are used in the experiments reported here (Methods). 

After the initial preparation of an ion in the |0) (F= 0) state by 
optical pumping, the procedure to generate and detect dressed states 
can be thought of as being divided into three segments (Fig. 2, inset). 
First (up to time t = T,), an incomplete stimulated Raman adiabatic 
passage** (STIRAP) is used to transfer the system adiabatically from 
the atomic-state basis to the dressed-state basis. Second (up to t = T>), 
the amplitudes of the dressing fields are kept constant for a holding 
time, T = T, — T;, because we are interested in creating and using 
dressed states (as opposed to population transfer). Between t = T; 
and t= T>, dressed states are present (Fig. la, b, right-hand side) 
and quantum operations with them are implemented through applica- 
tion of radio-frequency fields. For a single-qubit gate between dressed 
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Figure 2 | Lifetime of the dressed state |D). To prepare the dressed state |D), 
the dressing fields’ amplitudes are ramped up adiabatically until time T,. Then 
these amplitudes are kept constant until time T>. Between T, and T>, the dressed 
states shown in Fig. 1 (right-hand side) are present. Finally, after time T>, the 
STIRAP sequence is completed, adiabatically transferring the dressed state | D) 
back to a bare ionic state. The hold time is T = T, — T}. If state |D) stays 
invariant during time T, then the final ionic state will be |0). The data points 
indicate the probability to find the ion in state |0) upon completion of this 
sequence as a function of T. The solid line represents a fitted exponential giving 
a lifetime of 1,700 + 300 ms. Number of repetitions, n = 300; error bars, s.d. 
Experimental parameters are given in Supplementary Information. Inset, 
qualitative time dependence of the two dressing fields’ amplitudes (black and 
blue; set on resonance with the atomic transitions |—1) <> |0) and | +1) |0)). 


states (Fig. la, right-hand side) the radio-frequency field is on res- 
onance with the |0’)<>|+1) transition, and for a multiqubit gate 
(Fig. 1b) the radio frequency is detuned from this transition by the 
vibrational mode frequency, v. Third (t > Ty), the STIRAP sequence is 
completed when the ion is transferred from the dressed-state basis 
back to the atomic states. Any dephasing of a dressed state or transi- 
tions to other states during the holding time, T, gives rise to imperfect 
population transfer during step three of this sequence that returns the 
system to the atomic-state basis. This is described in more detail in 
Methods for the creation and detection of the state |D). 

In Fig. 2, we show the effectiveness of preparation and detection of 
|D) as a function of the holding time, T. The measurement shown there 
extends over 500 ms and an exponential fit of the data yields a lifetime 
for this state of 1,700 + 300 ms. This lifetime is limited by magnetic 
field fluctuations that couple |D) to other states and the creation of a 
different dark state through phase fluctuations between the two micro- 
wave fields. This represents a remarkable improvement, of more than 
two orders of magnitude, in the dependence on the magnetic fluctua- 
tions, as the dephasing time of the magnetically sensitive bare atomic 
states |+1) in this apparatus has a measured coherence time not 
exceeding 5.3 ms. 

We have investigated the effectiveness of the STIRAP process for 
creating dressed states and transferring them into the final state, as a 
function of the parameters characterizing the pulse sequence, and have 
found this technique to be robust over a wide range of experimental 
parameters (Methods). 

To demonstrate the enhanced coherence time of this qubit, we have 
conducted Rabi- and Ramsey-type measurements. First, the system is 
transferred to the dressed-state basis and then an additional radio- 
frequency field with Rabi frequency Q, is applied that induces Rabi 
oscillations between the dressed states |B) and |0’). Using the state |B) 
has the advantage that no phase of m between the radio-frequency 
fields is needed (compare with Fig. 1). Completing the STIRAP cycle 
then maps the system to an atomic state that depends on the position 
of the atom in the dressed-state Rabi cycle. Results are shown in Fig. 3a. 
The Rabi oscillations are sustained over 550 ms, demonstrating the 
long-lived coherence of the dressed states when driven by radio- 
frequency radiation. 

These experiments also demonstrate that coherent transfer to the 
dressed-state basis and subsequent application ofa Rabi pulse prepares 
a coherent superposition in the qubit space that can then be read out 
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Figure 3 | Single-qubit gates with dressed states. a, Rabi oscillations between 
dressed state |B) and |0’), induced by a radio-frequency field for times of up to 
more than 500 ms. The population in |B) is mapped onto state |0) (data points) 
at the end of the STIRAP and detection sequence. n = 50 (times up to 40 ms) 
and n = 25 (times of more than 500 ms); error bars, s.d. b, Ramsey-type 
measurement preparing a coherent superposition of |B) and |0') and probing it 
after time Tp, the free precession time. Two radio-frequency 1/2-pulses 
separated by a period Tx of free evolution are applied to the qubit transition. 
The radio frequency is slightly detuned from resonance (near 10.265 MHz, 
corresponding to the |—1) <> |0’) and |+1) <> |0’) transitions) yielding 
Ramsey fringes with period 1/(144.4 Hz) between 0.1 and 30 ms of free 
precession time. For the measurement between 500 and 1,000 ms, the period is 
1/(8.069 Hz) owing to a change in the detuning of the radio-frequency field. 
n = 20 (times up to 30 ms) and n = 40 (times 500-1,000 ms); error bars, s.d. 
Experimental parameters are given in Supplementary Information. 


efficiently after completion of the STIRAP cycle. This forms the basis 
for the measurements of the lifetime of coherent superpositions in the 
protected subspace, which we now turn to. 

Our Ramsey-type experiments test the dephasing time of the 
dressed-state qubit. Figure 3b shows that coherence is preserved for 
more than 1,000 ms, close to the absolute limit, of about 1,700 ms, set 
by the lifetime of the state |D) and more than two orders of magnitude 
longer than the dephasing time of the atomic states |—1) and |+1), 
making this scheme ideal for realizing a quantum memory. 

Multiqubit gates can be realized by coupling the electronic qubit to 
the motion of the ions’®''”*. Several schemes for achieving such con- 
ditional quantum dynamics are possible using dressed states. Here we 
outline the scheme illustrated in Fig. 1b. We use a pair of radio- 
frequency fields on the |0’)<>|+1) transitions (or a single radio- 
frequency field, depending on the choice of the phase between the 
dressing fields; Methods). These radio-frequency fields are detuned 
by v from resonance and are thus in resonance with the first motional 
sideband (Fig. 1b). In the dressed-state basis, this couples the 
|B) © |0’) (one additional field) or |D) <> |0') (two additional fields) 
qubit resonance of the protected subspace to the vibrational mode of 
the ion string. The coupling of |D) © |0’) or |B) © |0’), as appropriate, 
at zeroth order in the Lamb-Dicke parameter (carrier transition) will 
be cancelled in the rotating-wave approximation owing to the higher 
energy (by an amount corresponding to Q) of |B) or |D), and |D) or, 
respectively, |B) will be coupled only at first order in the Lamb-Dicke 
parameter such that we obtain a Hamiltonian of the form (Methods) 
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H=V2nQ,(\D) (0'|e +h.c.)(b* —b) 


where 1 is the Lamb-Dicke parameter, b and b* are respectively the 
annihilation and creation operators of the trap harmonic oscillator 
(“denotes adjoint), and h.c. denotes Hermitian conjugate. 
Importantly, because this gate has no carrier part, a small 7 value 
can be compensated for by increasing the radio-frequency power while 
obeying 72, <v. This will allow moderate static?*’* or oscillating’ 
magnetic field gradients to be used when realizing multiqubit quantum 
gates with radio-frequency or microwave radiation. Also, this scheme 
permits the realization of other types of gate, including gates with ions 
in thermal motion’®”’. 

We now outline how the dressed-state scheme for QIP can be 
applied to a collection of trapped ions. The long wavelength of the 
microwave radiation requires the use of a static magnetic field gra- 
dient, if individual addressing in frequency space is desired. A gradient 
of a static or oscillating magnetic field is required for the generation of 
the coupling of the electronic degrees of freedom to the motional 
degrees of freedom when microwave or radio-frequency radiation is 
used for quantum gates. Therefore, this does not represent an addi- 
tional experimental requirement. 

In the absence of a static magnetic field gradient, a single pair of 
microwave-dressing fields is sufficient to dress all ions. In the presence 
of a static gradient, each ion would require its own pair of driving 
fields. This can be accomplished efficiently by using a microwave 
frequency comb where the frequency spacing coincides with the 
change in static Zeemann shift between neighbouring ions (Fig. 4). 
Thus, the dressed-state structure for each ion remains essentially the 
same as in the single-ion case resulting in robust memory. For both 
single-qubit gates and multiqubit gates, individual addressing is 
achieved by choosing the frequency of the radio-frequency field. 
Corrections are small under the condition that Qs, < AEz/h, where 
AE; is the difference in Zeeman shifts between two neighbouring ions 
and fi is Planck’s constant divided by 27. An important advantage over 
the standard quantum computing scheme is that the carrier frequency 
and the sideband can be chosen by interference and not by setting the 
frequency on resonance with one of these transitions; that is, the choice 
is set by the phase difference between the two radio-frequency driving 
fields (Methods). This allows for fast multiqubit gates that are not 
limited by Q, < v but by 7, < ¥; that is, such gates are possible even 
in the presence of a small (effective) Lamb-Dicke parameter. 

The insight gained in this work removes a major obstacle to laser- 
free QIP (but is also applicable to laser-based schemes). Furthermore, 
this scheme is not restricted to trapped ions and is in fact applicable to 


|0) |0) |0) 


+1) +1) 1) +1) 1) +1) 


Figure 4 | Multiqubit scheme for three ions exposed to a magnetic field 
gradient. The magnetic field at one of the two outer ions has the same 
magnitude but is opposite in direction to the field at the other outer ion, 
producing opposite, symmetrical Zeeman energy shifts of the levels |—1) and 
|1). The arrows represent three pairs of microwave-dressing fields (each pair 
indicated by a different colour) from a comb with a frequency interval 
corresponding to the Zeeman shift between neighbouring ions. For each off- 
resonance microwave field with detuning 4 acting on a particular ion, there is a 
second field with equal and opposite detuning, — 4. Hence, each such off- 
resonance pair will couple only to either |B) or |D) (depending on the relative 
phase between the microwave fields) leading to equal and opposite a.c. Zeeman 
shifts, which then cancel. 
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other physical systems in which dephasing due to external perturba- 
tions has a role, for instance neutral atoms”’ and solid-state systems 
such as nitrogen-vacancy centres in diamond”, ion-doped crystals”, 


and circuit quantum electrodynamics”. 


METHODS SUMMARY 

The experimental system. The atomic hyperfine states of the electronic ground 
state of the '”'Yb* ions used in this work can be mapped to the general scheme 
shown in Fig. 1 as follows: |F=0)<>|0), |F=1, mp=—1)|-1), |F=1, 
mp= +1)@|+1), |F= 1, mp= 0) 10’) (ref. 30). 

The effectiveness of the STIRAP process for creating dressed states and trans- 
ferring them into the final state has been investigated in detail as a function of the 
parameters characterizing the pulse sequence (Methods). 

Single-qubit gate. We refer to {|0’), |D)} as the D qubit and to {|0’), |B)} as the B 
qubit in what follows. The Hamiltonian reads: 


Hogg = ©0|0) (0| + 2o([1)(1| —| -1)(-1)) 
+Q(|—1) (Oe! +e” |1) (Ole! + hic.) 
+Q, cos (At)(| —1)(0'] +h.c.) 

+Qg cos (Apt +4)(|1)(0'| +h.c.) 


Here, ( is the zero-field hyperfine splitting, w+ are the microwave frequencies 
with their initial phase difference 0, Ao is the radio frequency, and ¢ is the initial 
phase difference between the radio frequency fields. For 0 = 0 and ¢ = 1 we obtain 
the D qubit, and for 0 = mand @ = 0 we obtain the B qubit. If we set w_—) = @o + Ao 
and @ = Wy — A, then by moving to the interaction picture with respect to the 
time-independent part we find in the rotating wave approximation (RWA), for the 
D qubit, that: 


Hog = V2Q({B) (0| +h.c.) + V22,(|D) {0"| +h.c.) 


Q Q / 
= Fa) ul ~ la) (a + V224((D)(0| +h.) 


The first two terms shift states |u) and |d) (which both contain |B)) away from |D), 
and the second part creates the gate. The interactions created by the two micro- 
wave fields with Rabi frequency 2 and the radio-frequency fields of frequency 2, 
(Fig. 1a) permit the implementation of this single-qubit gate. The two radio- 
frequency fields have a phase difference of 7 at the initial time of the pulse, which 
realizes the coupling to |D) and not to |B). 
Alternatively, we may work with the B qubit (Methods). 

Multiqubit gate. The levels and fields of Fig. 1b (for the D qubit) are described by 
the Hamiltonian 


Anngg = ®o|0) (0| + 4o({1) (1| -|—1)(—1) 

+Q(|—1) (Ole?! +e!” 11) (lew + hc.) 
+ Qg cos|(Ao — 5)E|(| — 1) (0'| +h.c.) 

+Q, cos|(Ao — 6)t + 0+ m]({1) (0'| +h.c.) 
+vb* bl1)(1|+A(|—1)(—1|—|1)(1)(b+b") 


where / is proportional to the magnetic gradient as in ref. 6, 6 is the detuning of the 
radio frequency field, 0 = 0 for the D qubit and 0 = x for the B qubit. 
In the interaction picture with respect to the microwaves in RWA, and after 
the polaron transformation U=ell+})(+NG* —be—nl-1)(-6* 8), the radio- 
frequency part becomes 

UHU* =Q,(| _ 1)(0'Je™" —b) gidt 


+e 1)(0'Jeno™ —b) idt +h.c.) 


To first order in the Lamb-Dicke parameter, for the D qubit we obtain the 
Hamiltonian 


V2 Qq(|D) (0' |e —h.c.)(b* —b) 
To zeroth order, we obtain a coupling between |B) and |0'); however, this term is 


ignored because the coupling, ,, is much smaller than the energy gap, which is of 
order Q. For the B qubit, we get the analogous Hamiltonian (Methods). 
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Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


The experimental system. The atomic hyperfine states of the electronic ground 
state of the '”'Yb* ions used in this work can be mapped to the general scheme 
shown in Fig. 1 as follows: |F=0)<>|0), |F=1, mp 1o|-1), |F=1 
mp= +1)|4+1), |F= 1, mp=0) © |0’) (ref. 30). 

For STIRAP, two microwave fields with Gaussian amplitude envelopes shifted 
in time relative to each other are used to drive the |—1)<>|0) and |+1)<|0) 
transitions, respectively (Fig. 1). 

The effectiveness of the STIRAP process for creating dressed states and trans- 
ferring them into the final state has been investigated as a function of the para- 
meters characterizing the pulse sequence (see ref. 25 for optical STIRAP). The 
sequence is divided into discrete time increments At= 1/foN,, where N, is a 
positive integer, fg = Q,/2n and Q, is the peak Rabi frequency. The width of 
the Gaussian pulses is given by N/fo, where N is an integer. The STIRAP sequence 
is robust to variations of these parameters: we investigated the range 10 = N, = 40 
for N = 10 and found no variation in the effectiveness of the pulse sequence; that is, 
the overall fidelity of initial preparation of |0), preparation of |D) and read out stays 
constant (within statistical variations) at a value of about 93%. 

When measuring sequences with pulse widths varying in the range 2 = N = 20, 
equally good results were obtained for N = 4. The separation in time, s;/fo, of the 
two Gaussian pulses was varied over the range 0 = s;/fg = 40/fo, for a pulse width 
with N = 10 to obtain a plateau of high effectiveness (~93%) for values of s, in the 
interval 10 =s,=20. The best performance is obtained for equal detunings 
A,=A_, ideally 4; =0=A_, and for small relative detunings. For 
|6| =|44 — A_| <0.1Q, an experimental investigation yielded no statistically 
significant variation of the dressed-state preparation fidelity. 

To prepare, for example, the state |D), the population is first transferred from the 
initial state |0) to the atomic state | — 1) by applying a microwave 1-pulse. The first half 
of the STIRAP sequence then transfers the atomic population to the dressed state |D). 

A holding period in the evolution of the two STIRAP microwave fields is 
introduced at the crossing point of the amplitude envelopes of the two pulses. 
At that time, the occupation of |D) is at its maximum. Any dephasing of |D) or 
transitions to other states during the holding time, T, gives rise to imperfect 
population transfer during the second half of the STIRAP sequence that transfers 
the system to the atomic state | +1). 

Probing the state after the complete STIRAP sequence does not distinguish 
between |—1) and | +1), as both yield bright results on final detection of resonance 
fluorescence. Therefore, a 1-pulse swaps the populations of |+1) and |0) before 
final detection. Thus, a dark result indicates a successful STIRAP transfer between 
atomic states and dressed states, and a lifetime T of the dressed state prepared 
during the holding period (Fig. 2). 

To record Rabi oscillations between dressed states, we first prepare state |B). 
Then a radio-frequency pulse in resonance with |0’) <> |+1) (Fig. 1a) is applied 
during the holding time, T, in the evolution of STIRAP. This radio-frequency pulse 
induces Rabi oscillations between |B) and |0’). After the Rabi pulse, the STIRAP 
pulse sequence is completed and the population of the atomic state |() is probed as 
described above. If during the radio-frequency Rabi pulse state |0’) is populated, 
then the second part of the STIRAP sequence has no effect; that is, |0) is not 
populated at the end of the experimental sequence. 

Single-qubit gate. We refer to {|0’), |D)} as the D qubit and to {|0’), |B)} as the B 
qubit in what follows. The Hamiltonian reads: 


Hogg = @o|0) (0| + 4o((1)(1|—|—1)(— 1) 
+Q(|—1) (Ole?! +e |1) (Ole! + huc.) 
+Q, cos (Agt)(| —1)(0'| +h.c.) 

+ Q, cos (Agt + $)(1) (0'| +h.c.) 


Here, Wo is the zero-field hyperfine splitting, w+) are the microwave frequencies 
with their initial phase difference 0), Ay is the radio frequency, and ¢# the initial 
phase difference between the radio frequency fields. For 0 = 0 and ¢ = nm we obtain 
the D qubit, and for 0 = m and ¢ = 0 we obtain the B qubit. In the rotating-wave 
approximation we get 


Heqg = 000) (0| + Ao({1)(1| —| -1)(—1)) 


+Q(|—1) (Ole7?-* +e” |1) (Ole! + hc.) 


+.4(|—1)(0'fe** — el” |1) (0 |e”? +h.c.) 
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with 0 = 0 for the D qubit and 0 = zx for the B qubit. If we set o_; = @y + Ay and 
1 = Mo — Ao, then by moving to the interaction picture with respect to the time- 
independent part we find, for the D qubit, that 


Hogg = V2Q(|B) (0| +h.c.) + V2Q,(|D) (0'| +h.c.) 


= Flea l)(d| +V30,((0)(0| +h) 
The first two terms shift states |) and |d) (which both contain |B)) away from |D), 
and the second part creates the gate. The interactions created by the two micro- 
wave fields with Rabi frequency Q and the radio-frequency fields of frequency 2, 
(Fig. la) permit the implementation of this single-qubit gate. The two radio- 
frequency fields have a phase difference of 1 at the initial time of the pulse, which 
realizes the coupling to |D) and not to |B). 

Alternatively, we may work with the B qubit by choosing the same initial phase 
for the two radio-frequency fields; that is, one radio-frequency field is sufficient to 
couple to |B). We find that 


Big aa) V2Q(|D) (0| +h.c.)+ V2Q,((B) (o'| +hc) 


Q Q , 
Fa l*ul— Feld) d| +V20,(B)(0!| +h.) 


Multiqubit gate. The levels and fields of Fig. 1b (for the D qubit) are described by 
the Hamiltonian 


Fmgg = @o|0) (0| + Ao({1)(1| —|—1)(—1)) 

+Q(|—1) (Ole! +e |1) (Ole! + hc.) 
+Q, cos{(Ap — 5)t](|—1) (0'| +h.c.) 

+Q, cos[(Ag — 5)t + 0+ 1]({1) (0'| +h.c.) 
+vb* B1)(1|+A(|—1)(— 1] —|1) (1) +6") 


where / is proportional to the magnetic gradient as in ref. 6, 6 is the detuning of the 
radio frequency field, 0 =0 for the D qubit and 0 =n for the B qubit. In the 
rotating wave approximation, we get: 


Hmqg = o|0) (0| + Ao(]1) 1] —|-1)( 1) 

+Q(|—1) (Ole! +e” |1) (Ole + hc.) 

+ Q4(| =1)(0'fe M70) 4 ei 1) (Qe 40-9)" Lc.) 
+vb* b]1)(1| +A —1)(—1]-|1) (1p+b*) 


In the interaction picture with respect to the microwaves, and after the polaron 


transformation U=ell+})(+N@* —b)e—nl-1)(—1(b* 2), the radio-frequency part 
becomes 


UHUt =Q,(|—1)(0'|e7"" — Hel? 
+e"| 1)(0" |e" —b) idt +h.c.) 


To first order in the Lamb-Dicke parameter, for the D qubit we obtain the 
Hamiltonian 


V2 Qq(|D) (0' |e —h.c.)(b* —b) 


To zeroth order, we obtain a coupling between |B) and |0'); however, this term is 
ignored because the coupling, ,, is much smaller than the energy gap, which is of 
order Q. For the B qubit, we get the analogous Hamiltonian 


V2 Qq(|B) (0' |e —h.c.)(b* —b) 


When using a microwave frequency comb to dress many non-degenerate qubits, 
small errors due to higher-order effects may be introduced and the implementa- 
tion of gates may lead to off-resonance transitions and, hence, phase shifts in other 
ions. However, it is important to note that the size of this error is exactly the same 
as for magnetic-gradient-induced coupling®. This is due to the fact that the micro- 
wave fields almost completely decouple the {|u), |d)} subspace from the qubit 
subspace ({|D), |0’)}) as long as Q is much larger (typically an order of magnitude) 
than the radio-frequency coupling, 2,. 
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Perpendicular switching of a single ferromagnetic 
layer induced by in-plane current injection 


Ioan Mihai Miron!, Kevin Garello', Gilles Gaudin”, Pierre-Jean Zermatten”, Marius V. Costachel, Stephane Auffret?, 
Sébastien Bandiera’, Bernard Rodmacq?’, Alain Schuhl? & Pietro Gambardella’*+ 


Modern computing technology is based on writing, storing and 
retrieving information encoded as magnetic bits. Although the 
giant magnetoresistance effect has improved the electrical read 
out of memory elements, magnetic writing remains the object of 
major research efforts’. Despite several reports of methods to 
reverse the polarity of nanosized magnets by means of local electric 
fields** and currents**, the simple reversal of a high-coercivity, 
single-layer ferromagnet remains a challenge. Materials with large 
coercivity and perpendicular magnetic anisotropy represent the 
mainstay of data storage media, owing to their ability to retain a 
stable magnetization state over long periods of time and their 
amenability to miniaturization’. However, the same anisotropy 
properties that make a material attractive for storage also make it 
hard to write to*. Here we demonstrate switching of a perpendicu- 
larly magnetized cobalt dot driven by in-plane current injection at 
room temperature. Our device is composed of a thin cobalt layer 
with strong perpendicular anisotropy and Rashba interaction 
induced by asymmetric platinum and AIO, interface layers”’®. 
The effective switching field is orthogonal to the direction of the 
magnetization and to the Rashba field. The symmetry of the switch- 
ing field is consistent with the spin accumulation induced by the 
Rashba interaction and the spin-dependent mobility observed in 
non-magnetic semiconductors'’’?, as well as with the torque 
induced by the spin Hall effect in the platinum layer’*"*. Our mea- 
surements indicate that the switching efficiency increases with the 
magnetic anisotropy of the cobalt layer and the oxidation of the 
aluminium layer, which is uppermost, suggesting that the Rashba 
interaction has a key role in the reversal mechanism. To prove the 
potential of in-plane current switching for spintronic applications, 
we construct a reprogrammable magnetic switch that can be inte- 
grated into non-volatile memory and logic architectures. This 
device is simple, scalable and compatible with present-day magnetic 
recording technology. 

The coupling of spin and orbital angular momenta underlies the 
magnetic anisotropy properties of ferromagnets. Strong anisotropy 
allows for permanent, stable storage but also requires stronger mag- 
netic fields to write information to magnetic media. An ideal solution 
to this problem requires the spin-orbit interaction to be beneficial for 
both storage and writing purposes. Experiments on non-magnetic 
semiconductors have revived interest in effective magnetic fields ori- 
ginating in spin-orbit coupling’’’*~°. Such fields relate the spin of an 
electron to its momentum, converting a charge current into a source of 
spin polarization even in the absence of magnetism. More recently, 
spin-orbit fields have been predicted**” and observed in ferromagnets 
lacking structural inversion symmetry*’, where the s-d exchange 
interaction between charge carriers and localized d electrons mediates 
the effect of the spin-orbit fields on the magnetization. The possibility 
of generating strong spin-orbit fields in ferromagnetic metals (FMMs) 
is particularly interesting for applications, owing to the robust Curie 
temperature and perpendicular magnetic anisotropy (PMA) afforded 


by such systems. So far, however, the centrosymmetric lattice prop- 
erties of FMMs restrict this possibility to generation of the Rashba 
effect”, as inversion asymmetry can only be achieved by sandwiching 
an FMM layer between two dissimilar interfaces, one of which is 
usually a heavy, non-magnetic element”**. Owing to the Rashba inter- 
action, electrons flowing in the plane of a conductor with asymmetric 
interfaces experience an effective magnetic field Br = o%g(Z X j), where 
&p is the Rashba constant, z is a unit vector along the perpendicular 
axis (the z direction) andj is the vector current density. As the interface 
effects responsible for the Rashba interaction also induce PMA, Bg is 
orthogonal to the easy magnetization axis. Therefore, the Rashba field 
by itself cannot be used to control magnetization reversal in FMMs’. 

Here we demonstrate up-down magnetic switching of a cobalt dot 
characterized by strong PMA and Rashba interaction induced by an 
in-plane current. Bipolar reversal of the magnetization is achieved by 
injecting in-plane current pulses of either positive or negative sign 
parallel to a static magnetic field of moderate amplitude. This effect 
is discussed in terms of the symmetry of the current-induced spin- 
orbit fields acting on the cobalt layer. Switching occurs in single-layer 
FMMs independently of the domain configuration of the sample and 
without the need for a polarizer layer. We show that this phenomenon 
leads to innovative device architectures for spintronics. 

Each of our samples consists of a 500nm X 500nm cobalt dot 
sandwiched between a 3-nm-thick platinum layer (below) and a 1.6- 
nm-thick AlO, cap (above; Fig. 1a). The thickness of the cobalt layer is 
chosen to be 0.6nm to maximize the Rashba effect due to structure 
inversion asymmetry at the platinum and AIO, interfaces’. The AlO, 
and cobalt layers are each patterned into a square dot, whereas the 
platinum layer is etched into a cross-shaped structure for current 
injection and Hall voltage measurements. The perpendicular mag- 
netization of the dot, M,, is measured in terms of the anomalous Hall 
resistance, which varies by Ryan = 0.8 Q for opposite orientations of 
M,. A rotating sample stage is used to apply an external magnetic field, 
B, at polar and azimuthal angles 0 and ¢ (Fig. 1a). To study the effects of 
an electric current on M,, we apply 15-ns-long pulses of variable ampli- 
tude supplied by a voltage generator, using the circuit shown in Fig. 1b. 
All measurements are performed at room temperature. 

The central result of this work is shown in Fig. 1c, d, where we 
compare field-induced and current-induced reversal of M,. We first 
perform a standard magnetization measurement in the absence of 
current pulses, recording M, during a single sweep of B applied parallel 
to the current injection line (Fig. 1c, solid line). The field is intentionally 
tilted off-plane by 2° (0 = 92°, # =0°) to prevent the formation of 
magnetic domains. Magnetization reversal occurs at the coercive field 
|B,| = 300 mT, owing to the residual component, B,, parallel to the 
out-of-plane easy axis of the dot. We then repeat this magnetic field 
sweep, but stop at each value of B and inject a positive (set) current 
pulse and negative (reset) current pulse, measuring M, after each 
pulse. The black and red points (Fig. 1d) indicate the orientation of 
M, after the alternating injection of set and reset pulses of amplitude 


Catalan Institute of Nanotechnology (ICN-CIN2), E-08193 Barcelona, Spain. 2SPINTEC, UMR-8191, CEA/CNRS/UJF/GINP, INAC, F-38054 Grenoble, France. 3Departament de Fisica, Universitat Autonoma 
de Barcelona, E-08193 Barcelona, Spain. “Institucié Catalana de Recerca i Estudis Avangats (ICREA), E-08010 Barcelona, Spain. 


11 AUGUST 2011 | VOL 476 | NATURE | 189 


©2011 Macmillan Publishers Limited. All rights reserved 


LETTER 


Pitan (2) 


v- 03-02-01 0.0 0.1 02 03 


Hall 


Figure 1 | Device schematic and current-induced switching. a, Hall cross 
geometry. Black and white arrows indicate the ‘up’ and ‘down’ equilibrium 
magnetization states of the cobalt layer, respectively. b, Scanning electron 
micrograph of the sample and electric circuitry used in the measurements. V5, 
and Vj;, represent the two terminals for the Hall voltage measurements. c, Mz 


measured by the anomalous Hall resistance as a function of applied field, B. d, M, 


I, = 2.6 mA. This plot shows two striking effects, illustrated in Fig. le. 
First, black and red points that represent magnetizations of opposite 
sign indicate switching of M, from up to down and vice versa following 
every pulse of current. Second, the sense of switching reverses as B goes 
through zero. Positive and negative sweeps of B give identical results. 
We note that pulse-induced magnetization reversal occurs in the 
entire bistability range of the FMM delimited by B., down to 
B~5mTIT. This behaviour is independent of the domain configura- 
tion of the sample (Supplementary Information) and is remarkably 
different from that expected for known magnetic interactions. For 
example, the in-plane Rashba and Oersted fields, as well as Joule 


I | pera || ile 
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Figure 2 | Switching efficiency as a function of current amplitude. a, b, M, 
measured after injection of positive (black squares) and negative (red circles) 
current pulses of amplitude J, = 1.57 mA (a) and I, = 1.94 mA (b). Filled 
symbols indicate data recorded during a + B—> —B sweep and open symbols 
indicate data recorded during a —B—> +B sweep, as shown by the arrows. The 
solid and dashed lines in a represent M, as a function of field. c, Switching 
efficiency as a function of pulse amplitude and applied magnetic field. Region I: 


190 | NATURE | VOL 476 | 11 AUGUST 2011 


B (1) 
measured after the injection of positive (black squares) and negative (red circles) 
current pulses of amplitude I, = 2.58 mA. The data are reported during a single 
sweep of B, corresponding to the solid line in c. e, Schematic of the pulse sequence 
and magnetization measurements. In both cand d, Bis applied at 0 = 92°, parallel 
to the current direction (# = 0°). The 2° offset with respect to the ideal in-plane 
direction is used to define the residual component B, unambiguously. 


heating, would tend to favour a demagnetized state that is not 
compatible with the observed deterministic switching’, whereas 
precessional switching does not depend on the current direction”. 
Measurements performed over a range of currents offer further 
insight into the switching phenomenon (Fig. 2). Although weak current 
pulses have no effect, starting at around J, = 1.3mA (Fig. 2a) we 
observe a gradual reduction in the coercive field (Fig. 2c, red triangles). 
This behaviour can be understood by noting that the combination of a 
weak current-induced effect and the residual component B, assists 
magnetization reversal towards the equilibrium direction, parallel to 
B,. At I,~ 1.9 mA, however, we observe the onset of current-induced 


j (108 A cm) 
0.0 0.5 1.0 1.5 


conventional field-induced magnetization reversal occurs at B = B,. Region II: 
assisted reversal. Red triangles indicate the minimum external field required to 
reverse M, parallel to B,. Region III: pulse-induced switching. Black triangles 
indicate the minimum field at which positive current pulses reverse the 
magnetization antiparallel to B,. The maximum field at which switching is 
observed (blue open squares) coincides with the coercivity of the dot. 
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switching events (Fig. 2b), which become systematic as I, is further 
increased above 2.3mA (Fig. 1d). The high-field switching limit 
(Fig. 2c, blue squares) gradually extends up to B = B., indicating that 
the torque working against the action of B, becomes stronger as Ip 
increases. At the same time, the minimum in-plane field component 
necessary to trigger the reversal decreases to about 5 mT (Fig. 2c, region 
III). Furthermore, we find that the critical switching current scales with 
the lateral dimensions of the cobalt dot and the inverse pulse duration 
(Supplementary Information). 

A systematic study of current-induced magnetization reversal as a 
function of applied field direction (Fig. 3) shows that the switching 
torque is maximal when B is applied parallel to j and decreases away 
from this orientation following a sin(# — 2/2) function. Combining 
the above information, we observe that the action of the current on the 
magnetization is equivalent to that of an effective perpendicular mag- 
netic field Bs, ~ Bp X B ~ (Zz X j) X B, which changes sign on reversal 
of either B or j (Fig. 3c). In an FMM, however, relative to the internal 
exchange field produced by the local magnetization, the external field, 
B, has no direct influence on the spin polarization of the conduction 
electrons. Therefore, Bs, shall be considered the perpendicular com- 
ponent of an effective magnetic field proportional to Bp X M, where 
the orientation of M is initially determined by B. 

Although there is no theory predicting the magnetization reversal 
mechanism reported here, we observe that there are at least two effects 
that can induce spin accumulation parallel to Bg X M. We describe the 
first in analogy to phenomena observed in non-magnetic semi- 
conductors. Faraday measurements performed on InGaAs hetero- 
structures have shown that a magnetic field applied parallel to j 
induces a spin polarization component perpendicular to the plane of 
the current’. This phenomenon has been explained by the combined 
action of Bg and spin-dependent electron mobility, which leads to a 
net perpendicular spin accumulation proportional to the cross product 
Bg X B (ref. 12). We emphasize that this term has the same symmetry, 
current and field dependence as the effective switching field. This 


j>0,B>0 


Figure 3 | Dependence of switching on applied field direction. a, Diagrams 
representing M, reversal regions, showing the intensity and orientation of the 
applied magnetic field, B, at constant current (Ip = 3.3 mA). Colours indicate 
the switching sense: red, upwards; blue, downwards. Filled symbols show the 
coercive field at each angle. When B is aligned with the current (# = 0°), the 

switching areas extend throughout the entire bistability region delimited by B,, 
with the exception of a narrow area near zero field. As B is rotated away from 
the current, the sizes of the switching areas gradually decrease. When B is 
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analogy suggests that a similar mechanism might be at work in our 
case, considering that the spin-dependent conductivity required to 
explain the effect in InGaAs is a common feature of FMMs”. 
Following this argument (see Supplementary Information for more 
details), the accumulation of out-of-plane spin polarization in an 
FMM requires a non-zero in-plane magnetization component parallel 
to j, induced by a small external field. The final effect, mediated by s-d 
exchange, is equivalent to that of a downward or upward effective field, 
Bs,, parallel to the z component of Br X M, in agreement with our 
measurements. 

The second mechanism is based on the absorption of the spin 
current produced by the spin Hall effect (SHE) in platinum'*"* and 
diffusing into the cobalt layer. The absorption of this current, polarized 
along the y direction, is equivalent to a torque acting on the cobalt 
magnetization that has the same symmetry as the effect reported here. 
Estimates based on the spin Hall angle in platinum, however, show that 
the SHE is not sufficient to account for the intensity of the switching 
field (Supplementary Information). Moreover, we find that samples 
with stronger PMA anda larger degree of aluminium oxidation switch 
more easily than do magnetically softer dots with equally thick cobalt 
and platinum layers (Supplementary Figs 8 and 9). This observation is 
not compatible with a simple explanation based on the SHE. 
Nevertheless, we cannot exclude either a partial contribution by the 
SHE to switching or a more complex mechanism involving the SHE. 
For example, the vertical gradient of the Rashba field at the cobalt- 
platinum interface might induce a supplementary spin current that 
adds to the SHE in platinum. 

Although this work calls for a detailed theory of non-equilibrium 
spin-orbit phenomena in FMMs, it is important to distinguish two 
practical features of our results. First, when sufficient current and field 
are applied in the same direction, switching is robust and limited only 
by the intrinsic bistability of the FMM. Because neither the external 
field value nor its exact orientation is critical, a constant magnetic field 
produced by a macroscopic permanent magnet can be used to ensure 


j<0,B>0 j>0,B<0 


perpendicular to the current (¢ = 90°), switching disappears and is replaced by 
random nucleation. b, Azimuthal dependence of the switching efficiency at 
constant current (J, = 3.1 mA, 6 = 87°). Open symbols represent the largest 
field against which switching is observed; filled symbols show the coercive field. 
The dashed line is a sine function. c, Directions of the effective switching field, 
Bg, relative to j, the in-plane magnetization component (M,.) induced by B, and 
the Rashba field, Bp. The magnetization switches from ‘up’ to ‘down’ for j > 0, 
B>0Oandj<0,B<0, and from ‘down’ to ‘up’ forj > 0,B<0Oandj<0,B>0. 
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Figure 4 | Prototype of a reconfigurable ferromagnetic switch. a, Scanning 
electron micrograph of a device comprising two 50-nm-thick CoFe bar 
magnets parallel to the current injection line (bottom left and upper right 
corners) and an AlO,—cobalt-platinum dot (centre). b, Top two panels: 
switching curve of a device without CoFe magnets around the zero field region. 
M, is measured after injection of positive (black squares) and negative (red 
circles) pulses. No difference between positive and negative field sweeps is 
observed apart from random nucleation events near B = 0. Bottom two panels: 


bidirectional current switching of many independent magnetic bits. 
Second, the area separating the two opposite switching zones near 
B=0 is very narrow, meaning that very small applied fields suffice 
to induce reversal. This feature prompts an approach for applications. 
Small permanent magnets can be fabricated on-chip, close to or on top 
of each active dot. In this way, not only can the dots be switched, but 
their response to the current direction can also be inverted by reversing 
the polarization of the magnets. 

The ability to change the operation mode of a device is extremely 
valuable for programmable magnetic logic applications”””*. This 
motivated us to fabricate a small prototype switch by depositing two 
CoFe bars on top of the current injection lines near the cobalt dot 
(Fig. 4a). The shape anisotropy of the CoFe magnets ensures the align- 
ment between their stray field and current lines. Figure 4b shows the 
low-field region of a typical switching curve, comparing results for a 
device without magnets (the two upper panels) with those of a device 
with CoFe bars (the two bottom panels). In the first case, as B 
approaches zero, switching is replaced by random nucleation events. 
In the second case, owing to the additional stray field of the magnets, 
the switching curve shows hysteretic behaviour extending to the zero- 
field region (shaded area). We further tested zero-field switching using 
a complex sequence of current pulses (Fig. 4c). After the magnetization 
of the CoFe magnets has been saturated in one of two opposite direc- 
tions (Mcore > 0 and Mcore < 0), Mz reverses each time the pulses 
change polarity. As expected, reversing Mcore inverts the operation 
mode of the device. 

The ability to switch a single magnetic layer at room temperature using 
an in-plane current opens the way to a new generation of spintronic 
devices, combining planar geometry, highly stable perpendicular mag- 
netization, all-electrical write/read out schemes and reconfigurability. 
The layer structure of our prototype switch is extremely simple, is 
scalable and is based on materials compatible with present technology. 
The current density threshold for switching is of the order of 
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switching curve of a device including on-chip magnets. The shaded areas show 
that bipolar switching extends to zero external field. Positive and negative field 
sweeps reveal the hysteretic behaviour of the CoFe magnets as well as the 
inversion of the switching polarity near B = 0. c, Controlled switching sequence 
of the device in zero field after setting positive (green squares) and negative 
(orange circles) CoFe magnetization. The switching current is I, = 2.5 mA for 
pulses 9 ns long. a.u., arbitrary units. 


10° Acm ~, and is expected to improve with further interface engineer- 
ing. We note that, because the current is applied in-plane through a thin 
lateral surface, the absolute current required for switching can be made 
very small (<1 mA), providing significant gains in terms of integration 
and power consumption. Apart from its use in novel device architec- 
tures, this type of ferromagnetic switch also has interesting implications 
for existing technology. For example, a FMM oxide heterostructure 
could easily be integrated as the storage layer of a magnetic tunnel 
junction in magnetoresistive random-access memory, to decouple the 
read and write current paths. In this way, electrical stress of the tunnel 
barrier during the write process, which is an outstanding problem of the 
spin transfer approach’”*, could be avoided while maintaining a large 
tunnel magnetoresistance and read-out sensitivity. 


METHODS SUMMARY 


The samples were fabricated from aluminium (1.6nm), cobalt (0.6nm) and 
platinum (3 nm) layers deposited on a thermally oxidized silicon wafer by d.c. 
magnetron sputtering’. The deposition rates were 0.05nms ' (cobalt and 
aluminium) and 0.1nms (platinum) at an argon pressure of 2 X 10° mbar. 
After deposition, the samples were oxidized by a 35-s exposure to radio-frequency 
oxygen plasma at a pressure of 3X 10 * mbar and a radio-frequency power of 
10 W. This treatment ensures chemical stability of the layers and preserves the 
strong PMA typical of cobalt-platinum interfaces'*’’. The presence of two dis- 
similar interfaces, of which one is a heavy metal characterized by strong spin-orbit 
coupling, gives rise to an electric potential gradient across the cobalt layer. This 
type of structure inversion asymmetry induces a Rashba-type magnetic field 
parallel to the y direction when the current flows along the x direction’”*. The 
AlO,—cobalt—platinum films were patterned by electron beam lithography and ion 
beam etching into 500 nm X 500 nm AlO,—cobalt dots and platinum Hall crosses 
(Methods). This geometry allows the magnetic state of the dots to be monitored by 
electric measurements using the anomalous Hall effect. If an electric current is 
applied along one arm of the cross, the Hall resistance, Ry, measured along the 
transverse arm, is proportional to the perpendicular component of the mag- 
netization. Representative magnetization curves of a 500nm x 500nm device 
are shown in Supplementary Fig. 1. Smaller devices, 200 nm X 200 nm in size, 
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were fabricated to probe the scaling of the critical switching current with device 
size (Supplementary Fig. 5). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


The samples were fabricated from aluminium (1.6nm), cobalt (0.6nm) and 
platinum (3 nm) layers deposited on a thermally oxidized silicon wafer by d.c. 
magnetron sputtering’. The deposition rates were 0.0Snms_' (cobalt and 
aluminium) and 0.1nms ' (platinum) at an argon pressure of 2 10° mbar. 
After deposition, the samples were oxidized by exposure to radio-frequency 
oxygen plasma at a pressure of 3X 10 * mbar and a radio-frequency power of 
10 W for 35s. This treatment ensures chemical stability of the layers and preserves 
the strong PMA typical of cobalt-platinum interfaces'®”. The presence of two 
dissimilar interfaces, of which one is a heavy metal characterized by strong spin- 
orbit coupling, gives rise to an electric potential gradient across the cobalt layer. 
This type of structure inversion asymmetry induces a Rashba-type magnetic field 
parallel to the y direction when the current flows along the x direction’”®. 

Patterning. Square magnetic dots were patterned on top of the Hall cross by using 
two steps of electron beam lithography followed by ion beam milling and wet 
etching. During the first step, the current injection line is drawn onto the PAMA 
resist covering the AlO,—cobalt-platinum layer. After development, 15nm of 
titanium are deposited by electron beam evaporation. The lift-off procedure finally 
defines a 15-nm-thick titanium line. A second step of electron beam lithography 
(similar to the first one) is used to draw the transverse Hall branches perpendicular 
to the current injection line. Subsequently, we repeat the resist development, 
titanium evaporation and lift-off procedures. The device is now composed of 
two perpendicular, 15-nm-thick titanium lines that intersect. At their intersection, 


the total titanium thickness is 30 nm, the sum of the two depositions. Ion beam 
etching controlled by secondary ion mass spectroscopy is used to etch away the 
equivalent of 15 nm of titanium. At this stage, the AlO,—cobalt-platinum layer is 
trimmed into a cross as the titanium mask has been removed everywhere except at 
the intersection of the two lines, where 15 nm of Ti remain unetched. We then used 
a chemically selective wet etching step to remove the oxide and the cobalt layers 
that are not covered by titanium. At the end of this process, the sample is com- 
posed of a platinum cross with a magnetic dot on top. The success of the fabrica- 
tion technique is monitored by measurements of the anomalous Hall effect: after 
the chemical etching, the Hall resistance is reduced to approximately 70% of the 
initial value. The total device resistance was typically 3-5 kQ. However, we note 
that most of the resistance drop occurs in the platinum current lines, which are 
3 nm thick over a length of approximately 500 pm. 

Electrical measurements. The current pulses were generated by a 100-V/2-A pulse 
generator (Agilent 8114A/001). The rise and fall times of the pulses were approxi- 
mately 6 ns, as measured by a fast oscilloscope (Agilent DSO81304A) connected in 
series with the sample. The length of the current pulses was varied between 10 and 
100 ns. We note that the pulse length was always longer than the characteristic time- 
scale for magnetization dynamics (<1 ns) and that its effect is therefore analogous to 
that of a continuous current. To preserve the shape of the voltage pulse transmitted 
along the resistive platinum leads, we connected a 100-Q resistance in parallel with the 
sample. Two 100-kQ resistors were placed between the Hall branches and the data 
acquisition card (Fig. 1) to prevent current flow through the Hall contacts. 
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Reduced methane growth rate explained by 
decreased Northern Hemisphere microbial sources 


Fuu Ming Kai't, Stanley C. Tyler'}, James T. Randerson! & Donald R. Blake” 


Atmospheric methane (CH,) increased through much of the twentieth 
century, but this trend gradually weakened until a stable state was 
temporarily reached around the turn of the millennium’”, after 
which levels increased once more’. The reasons for the slowdown 
are incompletely understood, with past work identifying changes in 
fossil fuel, wetland and agricultural sources and hydroxyl (OH) sinks 
as important causal factors'* *. Here we show that the late-twentieth- 
century changes in the CH, growth rates are best explained by 
reduced microbial sources in the Northern Hemisphere. Our results, 
based on synchronous time series of atmospheric CH, mixing and 
13¢/°C ratios and a two-box atmospheric model, indicate that the 
evolution of the mixing ratio requires no significant change in 
Southern Hemisphere sources between 1984 and 2005. Observed 
changes in the interhemispheric difference of ‘°C effectively exclude 
reduced fossil fuel emissions as the primary cause of the slowdown. 
The '°C observations are consistent with long-term reductions in 
agricultural emissions or another microbial source within the 
Northern Hemisphere. Approximately half (51+ 18%) of the 
decrease in Northern Hemisphere CH, emissions can be explained 
by reduced emissions from rice agriculture in Asia over the past three 
decades associated with increases in fertilizer application’ and reduc- 
tions in water use’®"’. 


Several mechanisms have been proposed to explain the long-term 
slowdown of atmospheric CH, including decreases in source emis- 
sions***', changes in sink processes’, and a stabilization of CH, with 
relatively constant global emissions’. Using a three-dimensional atmo- 
spheric model, Bousquet et al.° provide evidence from a Bayesian 
inversion that the CH, slowdown in the 1990s was caused partly by 
decreases in fossil emissions in the Northern Hemisphere. Carbon and 
hydrogen isotope measurements have become an important means of 
determining the magnitude of sources and sinks of atmospheric CH4 
(ref. 13), including assessments of interannual- to centennial-scale 
trends in the global budget*'*"®. Here we analysed isotope and mixing 
ratio measurements from the University of California, Irvine (UCT), 
National Institute of Water and Atmospheric Research (NIWA) and 
the University of Washington, Stable Isotope Laboratory (SIL) net- 
works*'*"'7(Supplementary Table 1). Our goal was to quantify inter- 
hemispheric differences (IHDs) of CH, 8'°C-CH, and 5D-CH,, and 
to assess how they constrain recent changes in the global CH, budget. 

From the mixing ratio measurements, we could see that CH, growth 
rates in both hemispheres declined during 1979-2005 (Supplemen- 
tary Fig. 1). The IHD of CH, (defined as Northern Hemisphere CH, 
minus Southern Hemisphere CH,) started at approximately 80 + 20 
parts per billion (p.p.b.) during 1979-1981, and gradually increased to 
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Figure 1 | Long-term trends in atmospheric CH,, 5'°C-CH,, and 5D-CH,. 
The left panels show CH,; the middle panels show 5'°C-CHy, and the right 
panels show 6D-CHy. a-c, Original measurements used to construct the 
Northern Hemisphere and Southern Hemisphere time series. The observing 
stations are Niwot Ridge (black circles), Montana de Oro (red triangles), the 
Pacific Ocean cruise in the Northern Hemisphere (POCNH; magenta pluses), 
the UCI network 40° N (green asterisks), the SIL-POCNH (magenta squares), 
Baring Head (blue crosses), Scott Base (yellow asterisks), the Pacific Ocean 
cruise in the Southern Hemisphere (POCSH; cyan squares), the UCI network 
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40° S (green triangles), and the SIL-POCSH (cyan triangles). d—f, Twelve- 
month smoothed trends in the Northern Hemisphere (red line) and Southern 
Hemisphere (blue line). The smoothed trends were constructed after removing 
an annual cycle and applying an offset to each station based on hemisphere- 
specific reference sites. g-i, IHDs (green lines) were defined as the difference 
between the Northern Hemisphere and Southern Hemisphere smoothed time 
series in d to f. Error bars are one standard deviation (s.d.), on the basis of all the 
adjusted observations in a 12-month window centred at each monthly time step. 
For 5D-CH4, the error bars are one s.d. of the Pacific voyage measurements. 
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93+ 11p.p.b. during 1991-1995 (Fig. 1). The IHD subsequently 
decreased from 1995 to 2002, reaching a mean of 79 + 10 p.p.b. during 
2001-2005. For 8'°C, the Southern Hemisphere measurements indi- 
cated no significant long-term trend, with the mean during 1989-1993 
(—47.16 + 0.09%; mean + standard deviation, s.d.) almost the same 
as the mean during 2001-2005 (—47.09 + 0.01%o). In contrast, 82Cin 
the Northern Hemisphere increased from —47.40+0.07%o0 to 
—47.19 + 0.03% (Fig. le). Asa result, the IHD of 8'°C narrowed from 
—0.24 + 0.11%o0 to —0.10 + 0.04% over this time period (Fig. 1h). 
Considered separately from other constraints, the trend towards rela- 
tively enriched carbon isotope values in the Northern Hemisphere 
implied (1) a decrease in sources with isotopically depleted signatures 
(for example, agriculture, landfills or wetlands), (2) an increase in 
sources that were isotopically enriched (for example, Northern 
Hemisphere fossil fuel emissions or biomass burning), or (3) an 
increase in atmospheric removal by reaction with hydroxyl radical 
(OH) in the Northern Hemisphere. 

We also found that 5D-CHy, did not change substantially in the 
Northern Hemisphere during 1989-2005 (Fig. 1f). Northern 
Hemisphere 5D remained nearly at —94 + 1% between 1998 and 
2005. The IHD of D-CHy, ranged from — 12.5 to —2.5%o during the 
cruises made by the SIL network between 1989 and 1996 (Fig. li). 
Because of the sensitivity of 3D-CH, to sink processes’®, the relatively 
constant 6D-CH, suggested that there was not a substantial long-term 
change in the OH sink between 1998 and 2005. Other factors poten- 
tially influence D-CHy, however, and so to estimate quantitatively the 
impacts of possible changes in the OH sink’, we conducted several 
sensitivity simulations (Supplementary Table 2). 

To assess the implications of the measurements described above and 
to examine recent competitive hypotheses related to the levelling off of 
CH, mixing ratios, we constructed global CH, budget scenarios using 
a two-box atmospheric model’* (see Supplementary Methods). In a 
first step, we constructed a control simulation (scenario 1) that 
included natural and anthropogenic CH, sources (Supplementary 
Fig. 2). In this simulation, anthropogenic emissions associated with 
fossil fuels and agriculture—two of the largest anthropogenic terms— 
remained constant after 1980 (Supplementary Fig. 2a and b). We 
compared model estimates of CH, mixing and isotope ratio with 
observations, which were extended back in time with firn air and ice 
core measurements of CH, (ref. 19) and 8'°C-CHy, (ref. 20). The 
control captured the long-term trends in CH, and most of the varia- 
tions in the CH, IHD (Supplementary Fig. 2c and d). Considering CH, 
changes alone, these results appear to suggest that the long-term CH, 
slowdown can be explained by the levelling off of both fossil and 
agricultural emissions. However, differences between the model esti- 
mates and observations of 8'*C-CH, were substantial. For example, 
the model IHD of 5'°C-CH, remained approximately the same during 
1989-2005, whereas the observations showed an increasing trend (a 
lessening of the IHD (Supplementary Fig. 2f). The differences between 
the observed and modelled IHD in 8'*C-CH, implied that agricultural 
and fossil emissions may have had diverging trajectories and provided 
motivation for exploring the different emissions scenarios described 
below. 

In past work, decreases in fossil fuel emissions have been implicated 
as an important driver of the methane slowdown**"’. To assess this 
hypothesis using our isotope data, we constructed a second scenario 
assuming that fossil fuel emissions were solely responsible for the 
observed CH, trends and assuming that all the other sources were 
the same as in the control. First, we estimated the total flux of CH, in 
the two hemispheres using the smoothed observations (Fig. 1d) and a 
simple mass-balance inversion during 1984-2005. Figure 2a shows that 
the Northern Hemisphere flux decreased from 417 + 10TgCHyyr ' 
during 1984-1985 to 386 +5TgCHyyr | during 2004-2005. In par- 
allel, the Southern Hemisphere flux showed no significant change, 
starting at 148+5 and ending at 150+5TgCHy,yr | during this 
period. 
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Figure 2 | Variations in CH, fluxes and the impacts of source composition 


on isotopic trends. a, The CH, fluxes in both hemispheres were obtained from 
a mass-balance inversion using the mixing ratio observations shown in Fig. 1d. 
b, Comparison of the observed IHD of 8°C with model scenarios during 1989- 
2005. The simulation with decreases in fossil fuel emissions (scenario 2) did not 
capture the trend in the observed IHD of 5'°C: the 8!°C IHD in this scenario 
widened over 1989-2005. In contrast, the simulation with decreases in 
agricultural emissions (scenario 3) had an increasing trend in the IHD of 8'°C, 
similar to the observations. A scenario that included decreases in fossil fuels and 
other sources from ref. 5 (scenario 4) had a negative slope that fell between the 
control and fossil fuel scenarios. Additional sensitivity simulations with 
changes in sink processes (scenario 7 as described in the Supplementary 
Information) show the maximum potential for OH variability to contribute to 
the observed trend. For the CH, fluxes, error bars are one s.d. estimates of the 
fluxes derived from our inversion in a 12-month window centred at each 
monthly time step. For comparison, error bars are one s.d. estimates of the 
observations and model predictions in a 12-month window centred at each 
monthly time step. The standard deviations of the observations are the same as 
shown in Fig. 1. For the model predictions, the standard deviations take into 
account variations in model parameters, including the interhemispheric 
exchange time, the isotopic fractionation, and the isotopic composition of the 
sources. The slopes of the regression lines and standard deviation estimates are 
provided in Supplementary Tables 2 and 4. 


For this simulation (scenario 2), we assumed that all of the adjust- 
ments to the fluxes in both hemispheres (shown in Fig. 2a) were attri- 
buted to changes in fossil fuel emissions—with an isotopic signature 
specific to this source (Supplementary Table 3). The only difference 
between this scenario and the control run was the inventory of fossil fuel 
emissions (Supplementary Figs 2 and 3). As expected, this simulation 
reproduced most of the CH, mixing ratio observations (Supplementary 
Fig. 3c and d), with decreases in fossil fuel emissions after 1990 (Sup- 
plementary Fig. 3a). The decreases in fossil fuel, however, failed to meet 
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the constraints offered by the measurements of 8'°C-CH, (Supplemen- 
tary Fig. 3e). Importantly, the 5'°C IHD in this scenario widened during 
1989-2005 (Fig. 2b and Supplementary Fig. 3f). This contrasts with the 
observed 8'°C IHD that narrowed during the same time period (Fig. 2b 
and Supplementary Table 2). In this context, the 5'°C IHD observations 
did not support the hypothesis that decreases in fossil fuel emissions 
were the main contributor to the CH, slowdown during 1989-2005. 

Agriculture, including animal” and rice* emissions components, is 
a large term in the contemporary budget® and one that has been subject 
to considerable modification over the past four decades**”*. To assess 
the role of agriculture, we created another scenario (scenario 3) in 
which we assigned the isotope ratio of the Northern Hemisphere 
and Southern Hemisphere fluxes needed to match the CH, observa- 
tions solely to agricultural sources (Supplementary Fig. 4a and b) 
following the same approach as described above for the fossil fuel 
scenario. As expected, the agricultural scenario also matched the mix- 
ing ratio observations (Supplementary Fig. 4c and d). In contrast with 
the fossil fuel scenario, however, the 54°C IHD from this run had a 
positive slope that captured most of the observed trend during 1989- 
2005 (Fig. 2b and Supplementary Fig. 4f). 

We also constructed additional sensitivity and Monte Carlo simula- 
tions with our box model examining uncertainties associated with the 
OH sink, the north-south interhemispheric exchange time for atmo- 
spheric mixing, and the isotopic composition of different source terms 
(as well as potential trends in the isotopic signature of these sources) 
(Supplementary Tables 2 and 4). Together, these scenarios suggested 
that our simulation results were robust with respect to our choice of 
model parameters. Other key uncertainties in our analysis were asso- 
ciated with the representativeness of our reference stations in the 
Northern and Southern hemispheres, the large variations in CH, 
and 5'°C-CHy, observed at the beginning of our time series, and the 
challenges involved in combining stable isotope observations from 
several laboratories into one long-term time series (comparisons with 
other available mixing ratio and isotope observations are provided in 
the Supplementary Information). The mechanisms causing the large 
variations in the global methane cycle during the early 1990s have not 
been fully resolved’® and may influence the long-term trend in the 
5'°C-CHy, IHD reported here. 

What possible explanations are there for the long-term reductions in 
microbial sources within the Northern Hemisphere required by the 
atmospheric isotope observations? Important microbial sources 
include wetlands, ruminant animals, landfills and rice agriculture. We 
summarize below evidence that decreasing emissions from rice agricul- 
ture probably accounted for some of the observed source reductions in 
the Northern Hemisphere, using a biogeochemical model to synthesize 
information from country-level agricultural statistics and recent field 
studies. Improved management of landfills also may have contributed 
to Northern Hemisphere source reductions”. For ruminant animals 
and wetlands, however, key processes regulating these sources have 
not changed in a way that is consistent with the long-term decreases 
in emissions needed to explain the atmospheric observations**~*. 

Reductions in rice agriculture emissions have occurred primarily 
from land use intensification rather than changes in the area of rice 
production. Statistics compiled by the International Rice Research 
Institute”® show that the world’s rough rice (that is, unprocessed paddy 
rice) areas did not change by a substantial amount between 1980 and 
2005 (Fig. 3a). Considering rice production areas alone, trends over the 
past several decades are consistent with a stabilization of fluxes, but not 
the overall reduction needed to explain the CH, mixing ratio and 
isotope observations presented here. 

Accumulating evidence suggests, however, that application of 
chemical fertilizer? and more efficient water use’” substantially reduces 
CH, emissions per unit of area of rice production. CH, fluxes in rice 
fields are strongly regulated by levels of organic fertilizer input’. 
Chemical fertilizer application in China and India increased substan- 
tially from 1970 to 2005 (Fig. 3c). The rapid rise in chemical fertilizer 
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Figure 3 | Evidence for intensification of rice agriculture in Asia. The total 
world area under rice production has not substantially changed since the late 
1970s and is dominated by production in Asian countries (a). The two largest 
rice-producing countries during 1960-2005 were China and India (b). The use 
of chemical fertilizer in both countries, especially China, increased substantially 
during the same time period (c). The increase in rice yield (d) was consistent 
with rapid increases in chemical fertilizer application and implied a weakening 
dependence on organic fertilizer, which is both labour-intensive and a primary 
substrate for CH, production. Growing industrial water demand after the 1980s 
reduced the percentage of water withdrawals available for agriculture in China 
(e) and may be one factor contributing to new mid-season drainage practices. 
After the early 1980s, decreases in global CH, emissions from rice agriculture 
were predicted to occur on the basis of an analysis from a biogeochemical 
process model driven by climate, rice area, fertilizer and water management 
information (f). 


use is consistent with concurrent increases in rice yield (Fig. 3d) and 
reductions in organic amendments that are more labour-intensive’. 
Other studies have shown that changing the water management of rice 
paddies can reduce CH, emissions by about 10-80% (refs 10, 11 and 
27). At the country level, increasing urban and industrial demands for 
water have increased since 1980 and have limited the water available 
for agriculture’® (Fig. 3e). 

To assess the magnitude of emissions changes associated with rice 
agriculture during 1960-2005, we modified an empirically based bio- 
geochemical process model’’”® to include fertilizer application and 
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water management impacts on rice yield and methane fluxes (see 
Supplementary Information). Using this model, which is driven by 
climate, production area statistics, fertilizer inputs and management 
information, we estimated that the Northern Hemisphere rice emis- 
sions reached a maximum in the early 1980s and decreased by about 
15.5+1.9Tgyr ' between 1984 and 2005 (Fig. 3f). The decreasing 
trend in emissions explained approximately half (51 + 18%) of the 
observed trend inferred from the atmospheric CH, measurements 
(Fig. 2a). 

Here we found that the increasing enrichment of 5'°C in the 
Northern Hemisphere and a narrowing of the 5'°C IHD provided 
new information about the causes of the CH, slowdown between 
1989 and 2005. Model simulations indicated that decreases in 
Northern Hemisphere microbial sources had an isotopic signature con- 
sistent with the observed atmospheric changes. Although changes in 
atmospheric OH levels and fossil fuel emissions probably contributed to 
some of the variability during this period®”*, the trends in the 8'°C IHD 
presented here excluded these changes as the most important contri- 
butors to the long-term CH, slowdown. Changes in management of rice 
agriculture probably contributed to some of the observed decline in 
Northern Hemisphere emissions, on the basis of estimates from a bio- 
geochemical model. Evaluating the net impact of these management 
changes on climate requires careful consideration of other greenhouse 
gas fluxes, including production of N2O (see, for example, ref. 11). The 
future trajectory of CH, remains difficult to predict, with recent 
increases after 2006 potentially attributable to emissions from 
Northern Hemisphere wetlands” and biomass burning in the tropics’. 
Our work demonstrates the importance of considering changing agri- 
cultural practices in addition to climate effects on natural sources as key 
drivers of future atmospheric CH, levels. 


METHODS SUMMARY 


To quantify the IHDs of CH, 8'°C-CH, and 8D-CHu, we constructed a long-term 
time series in each hemisphere (at 40° N and 41° S) by using measurements from 
the UCI, NIWA and SIL networks*'*'”. These data were selected because they 
possessed multi-year 5'°C and 8D measurements of atmospheric CH, required in 
our analysis for constraining CH, source and sink processes. We then used a two- 
box atmospheric model'* that included information about CH, source and sink 
processes (and their impacts on isotopic fractionation) to examine various hypo- 
theses formulated to explain the overall decline in the CH, growth rate. We forced 
the model using emissions inventories (see, for example, ref. 22) for anthropogenic 
emissions from 1700 to 1980, and in our control simulation these emissions were 
held constant thereafter (through 2005). The model included changes in soil and 
animal CH, source isotopic composition over the past two centuries induced by 
changes in 5'°C-CO,. Finally, to evaluate changes in rice agricultural sources over 
the past few decades, we used an empirical process-based biogeochemical model 
with parameterizations derived from field measurements””°. We forced the bio- 
geochemical model with time series of rough rice area and fertilizer application 
rates using data from the International Rice Research Institute’’. Details of the 
sampling and measuring procedure, comparisons of our measurements with 
observations from other networks, analysis methods, and modelling approaches 
are described in the Supplementary Information. The main constraints offered by 
the observed mixing and isotopic ratios described above are summarized in a 
schematic figure (Supplementary Fig. 5). 
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Methane and ethane are the most abundant hydrocarbons in the 
atmosphere and they affect both atmospheric chemistry and climate. 
Both gases are emitted from fossil fuels and biomass burning, 
whereas methane (CH,) alone has large sources from wetlands, agri- 
culture, landfills and waste water. Here we use measurements in firn 
(perennial snowpack) air from Greenland and Antarctica to recon- 
struct the atmospheric variability of ethane (C,H,) during the 
twentieth century. Ethane levels rose from early in the century until 
the 1980s, when the trend reversed, with a period of decline over the 
next 20 years. We find that this variability was primarily driven by 
changes in ethane emissions from fossil fuels; these emissions peaked 
in the 1960s and 1970s at 14-16 teragrams per year (1 Tg = 10’ g) 
and dropped to 8-10 Tg yr‘ by the turn of the century. The reduc- 
tion in fossil-fuel sources is probably related to changes in light 
hydrocarbon emissions associated with petroleum production and 
use. The ethane-based fossil-fuel emission history is strikingly dif- 
ferent from bottom-up estimates of methane emissions from fossil- 
fuel use”, and implies that the fossil-fuel source of methane started 
to decline in the 1980s and probably caused the late twentieth 
century slow-down in the growth rate of atmospheric methane** 

Ethane is an organic trace gas that is primarily emitted to the atmo- 
sphere during mining, processing, transport and consumption of fossil 
fuels, during use of biofuels, and during biomass burning*®. It acts as a 
precursor of ozone and carbon monoxide in the troposphere. Ethane is 
oxidized rapidly by the hydroxyl radical (OH”) and has a seasonally 
varying lifetime (annual mean ~2 months). The short lifetime, coupled 
with a north-south asymmetry in its sources, leads to geographic and 
temporal variability in its atmospheric abundance*’. Ethane abund- 
ance displays strong seasonal variability, with peak-to-peak amplitudes 
comparable to annual-mean levels. Annual-mean ethane levels are 
highest at high northern latitudes (HNL, that is, 30-90° N; 1.5 parts 
per billion) and lowest at high southern latitudes (HSL, that is, 30-90° S; 
250 parts per trillion)*. 

We measured ethane in firn air collected at Summit (in Greenland), 
at the West Antarctic Ice Sheet — Divide, WAIS-D (in Antarctica), and 
at the South Pole. A synthesis inversion method was used to develop 
atmospheric histories of ethane for each site, using a one-dimensional 
firn air diffusion model (Fig. 1) (see Methods Summary). The mean 
temperatures and the ice accumulation rates are very similar at 
Summit (—31 °C, 20cm yr ') and at WAIS-D (—31 °C, 22cm yr), 
resulting in similar gas age distributions in the firn and atmospheric 
histories constrained for the past 50-60 years (refs 8, 9). South Pole is 
considerably colder (—51°C), with a lower ice accumulation rate 
(~8cmyr_') and a deeper firn column. As a result, atmospheric 
histories based on South Pole firn air measurements are constrained 
for the past 80-90 years (ref. 10). The ethane atmospheric histories 
based on the WAIS-D and South Pole firn air measurements display 
the same trends for the period since 1950 (Fig. 1). 

Mean annual ethane levels measured over Summit and South Pole 
are consistent with those at other high-latitude sites, indicating that the 
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Figure 1 | Ethane mixing ratios in firn air at three sites, and the atmospheric 
histories derived from these measurements. a-f, Results for Summit 

(a, b), South Pole (c, d) and WAIS-D (e, f). Filled and open squares in a, c and 
e show measurements with estimated uncertainties (error bars, +2 s.e.) 
(Supplementary Data). Solid lines in a, c and e show modelled firn profiles for 
the three respective sites, with the five different atmospheric histories (solid 
lines) shown in b, d and f. These five atmospheric histories were obtained by 
inverse modelling of the measurements with five different boundary conditions 
at f, that are identified by different colours, each representing a different pre- 
industrial mixing ratio for ethane. The data points in the top 35 m (Summit and 
South Pole) and 40 m (WAIS-D) of the firn are subject to the effects of seasonal 
variations in surface ethane levels (Supplementary Information) and are 
ignored during inverse modelling (open squares in a, c and e). The inversions 
were forced with contemporary annual-mean surface mixing ratios of 

1,460 p.p.t. at Summit and 215 p.p.t. at South Pole and WAIS-D (red triangles 
in a, c and e) (Supplementary Information). The South Pole atmospheric 
history is overlaid on the independently derived WAIS-D atmospheric histories 
for comparison (dashed line in f). The inversions are sensitive to assumptions 
about the pre-industrial ethane levels before 1950, 1940 and 1910 at Summit, 
WAIS-D and South Pole, respectively (shaded areas), implying that only the 
inversion results for the later years are valid atmospheric histories. 
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measurements in polar firn air provide a reasonable sampling of the 
ethane mixing ratios in high latitudes (see Supplementary Informa- 
tion). Together, the three atmospheric histories show ethane levels 
peaking in the HNL and HSL atmospheres in the early 1980s, followed 
by a decline over the 20 years that followed. The South Pole site con- 
tains sufficiently old air to constrain the ramp-up period during the 
early to mid twentieth century (Fig. 1). On the basis of these atmo- 
spheric histories, HSL annual-mean ethane levels increased fivefold 
between 1910 and 1980 (from about 60p.p.t. to 280p.p.t.), and 
declined more than 10% (to less than 250p.p.t.) by 2000. In the 
HNL, ethane increased from ~1.7p.p.b. in 1950 to ~2.0p.p.b. in 
1980, and declined 25% to ~1.5 p.p.b. in 2000. 

To infer the causes of the atmospheric ethane variations in terms of 
changes in large-scale sources, we must relate high-latitude ethane 
levels to hemispheric averages. We used the modern atmospheric 
ethane distribution*® and sensitivity tests with the UCI Chemical 
Transport Model (UCI-CTM)'*”” to derive ratios relating the response 
of the HNL and HSL to changes in hemispheric mean ethane levels (see 
Supplementary Information). Next, we used a simple two-box model 
(see Methods Summary) representing the troposphere in the Northern 
and Southern hemispheres to simulate variations in mean atmospheric 
ethane levels over the past century resulting from various scenarios for 
fossil-fuel, biomass-burning, and biofuel emissions. Fossil-fuel and 
biofuel emissions are concentrated in the northern mid-latitudes, 
and biomass burning in the tropics. As a result, these sources con- 
tribute to hemispheric ethane levels with different efficiencies (see 
Methods Summary and Supplementary Information). 

The two-box model was used to infer the fossil-fuel ethane emission 
histories needed to achieve agreement with the firn-air-derived atmo- 
spheric ethane histories from Summit and South Pole, under various 
assumptions about biomass-burning emissions (Fig. 2, see Supplemen- 
tary Information). The results show clearly that ethane variability over 
decadal timescales during the twentieth century was dominated by 
changes in the fossil-fuel ethane source. When biomass-burning and 
fossil-fuel ethane emissions are both allowed to vary after 1950, the 
model yields a biomass-burning source of less than 1 Tgyr * in 1950, 
rising to ~3Tgyr | by 2000, followed by a drop to 2Tgyr ‘. The 
model results show a large change in fossil-fuel ethane emissions in 
the first half of the twentieth century, peaking at 14-16 Tgyr ' in the 
late 1950s. Fossil-fuel emissions are then constant for about 20 years, 
followed by a 45% drop over the next 30 years. The model yields 
8-10Tgyr ' for fossil-fuel ethane emissions at the end of the twentieth 
century, consistent with the modern ethane budget inferred from atmo- 
spheric ethane levels®*’’. 

Fossil-fuel emissions are also a significant source of methane into 
the atmosphere, and the observed decline in the growth rate of atmo- 
spheric methane** parallels the decline in atmospheric ethane levels 
during 1980-2000. This evidence suggests that the decline in methane 
growth rates was caused by a gradual reduction in fossil-fuel emissions, 
which was already underway when continuous direct atmospheric 
measurements of methane started in 1984 and 1985°*. The stabiliza- 
tion of atmospheric ethane in recent years* (also see Supplementary 
Information) suggests that fossil-fuel emissions are now steady. Thus, 
the recent increases in atmospheric methane levels are probably not 
derived from increased fossil-fuel emissions and must be due to other 
sources, as previously suggested'*”°. 

The ethane-based biomass-burning emission history agrees well 
with some independent estimates®’®. However, our fossil-fuel emis- 
sions history is quite different from bottom-up methane emission 
inventories’ (Fig. 3). Most notably, the ethane-based fossil-fuel emis- 
sions display a steep ramp-up after 1920 anda sharp decline after 1980, 
whereas the bottom-up methane fossil-fuel emission inventories display 
a generally increasing trend through the entire twentieth century, with 
the sharpest increases occurring between 1950 and 1980. Hydrocarbons 
are emitted from a variety of fossil-fuel sources, each exhibiting a dif- 
ferent range of methane/ethane emission ratio (MER). For example, 
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Figure 2 | Ethane source emissions and the resulting atmospheric histories. 
a-d, Historical global ethane emissions (a, b) and the resulting atmospheric 
histories (c, d) are derived with the two-box model to be consistent with the 
site-specific atmospheric histories based on firn-air results (Fig. 1). Fossil-fuel 
emission histories were developed by considering five different biomass- 
burning cases. In the first four cases, biomass-burning emissions are fixed at 1, 
2,3 and4Tg yr! (black, purple, green and blue lines, respectively; a-d), and 
the fossil-fuel emissions are varied to minimize the 7” value of the fit to the firn- 
air-based ethane atmospheric histories (black squares in c and d) from both the 
high northern latitudes and high southern latitudes (HNL and HSL) during 
1950-2000, and from HSL only during 1900-1940. We included 2010 in the 
box model optimizations by assuming the atmospheric ethane levels remained 
constant at the mixing ratios used as surface tie-points used in firn inversions 
(Fig. 1). As a fifth case, the fit to the atmospheric histories is optimized by 
allowing both the fossil-fuel and the biomass-burning emissions to vary (thick 
red lines). Variable biomass burning is considered only for the period since 
1950, because atmospheric histories from both hemispheres are needed to 
constrain the partitioning between fossil-fuel and biomass-burning emissions. 
Emissions from biofuel burning are fixed at the historical estimates (dashed 
black line; a) in all five cases*. The atmospheric lifetime of ethane (Methods 
Summary) is also the same in all five cases. The hemispheric-average 
atmospheric histories are also shown for the fifth case (dashed red lines; c and 
d), in which fossil-fuel and biomass-burning emissions are both varied. Ethane 
levels in HNL and HSL are obtained from Northern and Southern hemispheric 
(NH and SH) levels using the following ratios for fossil-fuel, biofuels and 
biomass-burning emissions: HNL/NH: 1.6, 1.4, 1.0; HSL/SH: 0.7, 0.7, 0.7. 
Values differ from 1.0 owing to the latitudinal distribution of each type of 
emission (Supplementary Information). 


coal and natural gas emissions are methane-rich (MER = 5-100) with 
respect to emission ratios measured near oil-storage and oil-processing 
facilities (MER = 3-5)’. It is possible that the global-average MER (the 
ratio of total methane to total ethane emissions from all fossil-fuel 
sources combined) changed over time as the relative emission strengths 
of the different fossil-fuel sources varied during the twentieth century, 
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Figure 3 | Ethane and methane emissions from fossil fuels, biofuels and 
biomass burning. a, Ethane-based fossil-fuel emission histories (left y-axis) for 
fixed biomass burning (solid grey lines) and variable biomass burning (solid 
black line) cases, compared with bottom-up methane emission estimates from 
fossil fuels (coloured lines, right y-axis) based on the following sums of 
emissions: blue, gas flaring, gas supply and coal mining’; red, fossil-fuel 
consumption, fossil-fuel production and industrial’; green, oil, gas and solid 
fuels (data from http://edgar.jrc.ec.europa.eu/). The oil and gas only methane 
emissions from EDGAR4.1 (see URL in previous sentence; green dashed line) 
and gas flaring and gas supply methane emissions from ref. 1 (blue dashed line) 
are also shown. The trends in the ethane-based histories are relatively 
insensitive to assumptions about biomass burning. None of the bottom-up 
methane fossil-fuel emission inventories display a persistent decline during 
1980-2000 that would be consistent with the ethane-based emission histories. 
The scales of the left and right y-axes are set at a ratio of 8 because it suits the 
presentation well and is consistent with b. b, Ethane emissions (left y-axis) from 
biomass burning (dashed black line) and biomass-burning+ biofuels (solid 
black line) for the variable biomass-burning case in the two-box model 
compared with methane emissions (right y-axis) attributed to the total burning 
product from ref. 1 (blue line)’ and with EDGAR-HYDE 1.4 (red line)’. The 
scales of the left and right y-axes are set at a ratio of 8, which is a measurement- 
based estimate of methane/ethane emission ratio (MER) from biomass 
burning”. Our results and the bottom-up methane inventories are consistent in 
how hydrocarbon emissions from biomass burning changed over time and 
suggest an MER of 8 on a Tg per Tg basis, in agreement with the earlier 
estimates”. Our estimate of ethane biomass-burning emissions from the 
variable biomass-burning case is also in good agreement with some 
independent estimates of historical ethane emissions from biomass burning’® 
(green diamonds, left y-axis) and a total burning emissions estimate (biomass 
burning + biofuels) for present day® (red triangle, left y-axis). 


but the apparent differences between the ethane-based fossil-fuel emis- 
sion histories and inventory-based methane emissions (Fig. 3) cannot be 
reconciled with such considerations. Inconsistencies are already appar- 
ent in mid-century, when the ratio of bottom-up methane emissions to 
ethane-based fossil-fuel emissions imply an MER of 2-3 (Fig. 3). It is 
highly unlikely that the global-average MER during the 1950s could be 
lower than point sources with the lowest MERs, especially considering 
that both coal and natural gas were sources of significant hydrocarbon 
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emissions during that period. This suggests the bottom-up methane 
inventories underestimate fossil-fuel emissions at mid-century. 

The ethane-based estimates of fossil-fuel emissions show a decline 
during the 1980s, while the methane-inventory-based estimates of 
fossil-fuel emissions show an increase (Fig. 3). Opposing trends in the 
fossil-fuel emissions of ethane and methane are highly unlikely, because 
this requires ethane to be selectively removed from both newly intro- 
duced and existing hydrocarbon sources while the residual methane is 
released to the atmosphere. Natural gas is used in the production of 
feedstock ethane in plastics manufacturing. However, it seems 
implausible that methane, which is economically the most valuable 
component of natural gas, would simply be vented back into the atmo- 
sphere after ethane had been removed from natural gas. In fact, the 
amount of feedstock ethane produced from natural gas is inversely 
related to the price of natural gas’’, and probably declined as natural 
gas became progressively more valuable during the second half of the 
twentieth century. We conclude that the discrepancy between the 
ethane-based fossil-fuel emission histories and the fossil-fuel emission 
histories from the bottom-up methane inventories cannot be explained 
by realistic changes in global-average MER during the twentieth 
century. 

The decline in fossil-fuel emissions, as calculated from our firn-air 
ethane records, coincides with a period of rapid expansion in natural 
gas production during the second half of the twentieth century'*”’. We 
speculate that the rising economic value of natural gas during the late 
twentieth century’ and the development of cleaner technologies led to 
sharp reductions in the release of light hydrocarbons into the atmo- 
sphere. Emissions linked specifically to the growing natural gas industry 
must have been more than offset by large reductions in the venting of 
light hydrocarbons, including methane and ethane, associated with 
production and processing of petroleum. These changes appear to be 
underestimated in the bottom-up methane inventories. We estimate 
that the total decline in fossil-fuel emissions of ethane was 5-6 Tg yr * 
during 1980-2000. Attributing this decline entirely to decreases in 
fossil-fuel emission sources with the lowest MER (3-5) implies a 15- 
30Tgyr ' drop in fossil-fuel emissions of methane. Distributing the 
decline among different fossil-fuel sources would yield a larger change 
in methane emissions from fossil fuels. An independent inversion ana- 
lysis of methane observations suggests that a 20 Tg yr * drop in fossil- 
fuel methane emissions during the 1990s contributed to the decline in 
methane growth rates’. Our results are consistent with such a decline, 
but provide additional evidence that the drop in fossil-fuel emissions 
started sooner, by about a decade. The decline in fossil-fuel emissions 
during the 1990s accounts for less than 60% of the total reduction 
during 1980-2000 in the two-box model results for ethane, suggesting 
that the reduction in fossil-fuel methane emissions could be ~40% 
larger than the previous estimate” of 20 Tg yr '. 

Given that we do not have comparable observations from lower 
latitudes, it is possible that a shift in the location of fossil-fuel emissions 
towards lower latitudes within the Northern Hemisphere contributed 
to the observed ethane decline during 1980-2000. However, we 
estimate this effect to be relatively small on the basis of sensitivity tests 
with the UCI-CTM, which show that possible changes in the location 
of the emission reductions for ethane could add about 20% to the 
magnitude of emission reductions calculated above (see Supplemen- 
tary Information). 

It is also possible that the late twentieth century decline in ethane 
could have been caused by a decrease in the atmospheric lifetime of 
ethane, but we estimate the likelihood for such changes to be low. 
Assuming fixed emissions, a 5-6Tgyr ' increase in ethane loss 
(roughly one-third of the peak budget) would be required to account 
for the observed ethane decline. A change of this magnitude due to 
increases in OH" concentrations during the late twentieth century is 
unlikely”’”’; however, atmospheric levels of chlorine atoms (Cl*) might 
have increased, because of increasing tropospheric NO,, increasing 
tropospheric ozone and increasing acidity of aerosols*”*. A Cl° sink 
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of the magnitude required to explain the ethane decline would have 
a minor effect on the methane budget (~3%), because the relative 
reactivity of Cl° versus OH® (kq/koy) is considerably smaller for 
methane than it is for ethane. Accounting for the ethane decline would 
require an increase of roughly 1.1 X 10* Cl° cm * over the entire 
marine boundary layer, or a much larger increase if just the polluted 
boundary layer is considered. Estimates for the clean marine boundary 
layer are in the range 10°-10*CI° cm ®. It is not possible to assess 
whether a change of the required magnitude is viable with the limited 
observational data available at present**”>. 


METHODS SUMMARY 

Firn-air measurements, modelling and inversion. At all three sites, multiple 
flasks (see Supplementary Information) from the same depth were filled sequen- 
tially, using established methods’’. All flasks were analysed at the University of 
California, Irvine, using ~100 cm? STP samples on a gas chromatography-mass 
spectrometry (GC-MS) system designed for trace gas analysis on small air sam- 
ples*®. Summit and South Pole flasks were analysed at least twice. We used a one- 
dimensional firn-air model and a synthesis inversion method to derive atmospheric 
histories of the annual-mean, high-latitude, tropospheric abundances of ethane 
from the firn-air measurements (see Supplementary Information). 

Two-box model. The Northern and Southern hemispheres are represented as 
equal mass boxes of 2.2 X 10” Tg (exchange time 1 year). Ethane is lost through 
OH” oxidation (14.6 months ') and transport into the stratosphere (1/35 yr‘), 
yielding a present day lifetime of 2.3 months. The annual-mean OH” loss frequency 
varies over the past 100 years based on the methane feedback on tropospheric OH": 
—0.32% in OH® for every +1% in methane”. The OH” changes differently in 
Northern and Southern hemispheres: —0.305% and —0.335%, respectively, based 
on simulations with the UCI-CTM'”. We adopt a methane increase of 900 p.p.b. 
to 1,790 p.p.b. since 1900”. Fossil-fuel, biofuel and biomass-burning sources in the 
model Northern Hemisphere are adopted from previous work’ and represent 93%, 
81% and 58%, respectively, of the total emissions. The fossil-fuel emission histories 
for various biomass-burning scenarios are developed by an inverse optimization 
algorithm (see Supplementary Information). 

Three-dimensional model. The UCI-CTM’s simulation of ethane with realistic 
distribution of sources” is used to relate the northern and southern tropospheric 
mean abundances to the annual-mean abundance over the ice sheets, and thus 
provide a correction to the two-box model histories for the firn-air modelling. 
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C4 grasses prosper as carbon dioxide eliminates 
desiccation in warmed semi-arid grassland 
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Jana Heisler-White*, Feike A. Dijkstra’ & Mark West! 


Global warming is predicted to induce desiccation in many world 
regions through increases in evaporative demand*”’. Rising CO, 
may counter that trend by improving plant water-use efficiency*’. 
However, it is not clear how important this CO,-enhanced water use 
efficiency might be in offsetting warming-induced desiccation 
because higher CO, also leads to higher plant biomass, and therefore 
greater transpirational surface”®’. Furthermore, although warming 
is predicted to favour warm-season, C, grasses, rising CO2 should 
favour C3, or cool-season plants*. Here we show in a semi-arid grass- 
land that elevated CO, can completely reverse the desiccating effects 
of moderate warming. Although enrichment of air to 600 p.p.m.v. 
CO, increased soil water content (SWC), 1.5/3.0 °C day/night warm- 
ing resulted in desiccation, such that combined CO, enrichment and 
warming had no effect on SWC relative to control plots. As pre- 
dicted, elevated CO, favoured C3 grasses and enhanced stand pro- 
ductivity, whereas warming favoured C, grasses. Combined 
warming and CO, enrichment stimulated above-ground growth of 
C, grasses in 2 of 3 years when soil moisture most limited plant 
productivity. The results indicate that in a warmer, CO2-enriched 
world, both SWC and productivity in semi-arid grasslands may be 
higher than previously expected. 

Grass-dominated, dry rangelands account for over 30% of Earth’s 
terrestrial surface”’® and provide most of the forage for the world’s 
domestic livestock. Among the most important of these include the 
vast Central Asian steppes, North American prairies, Australian 
rangelands, plus extensive grazing lands of Africa, South America 
and the Mediterranean. Productivity of these lands, which is under 
increasing pressure with human population growth, is primarily limited 
by water®". Soil water availability is driven by complex interactions 
between precipitation, temperature’”’, ambient CO (ref. 4) and soil 
properties, rendering rangelands and the livestock and cultures they 
support particularly susceptible to climate change and rising COb. 
Higher CO, concentrations induce stomatal closure, which reduces leaf 
transpiration and increases plant water-use efficiency and SWC*. 
Despite agreement among global climate models that climate change 
will induce drought-like conditions in several rangeland-dominated 
world regions’”, considerable uncertainty remains as to how stomatal 
closure from CO, might interact with increased temperature to influ- 
ence stand-level evapotranspiration, soil water dynamics and plant 
productivity. Many believe that CO2-induced reductions in transpira- 
tion at the leaf level will be largely offset at the canopy level by increases 
in leaf area*®”", 

Most rangelands are dominated by graminoids, and contain a mix- 
ture of the two major photosynthetic pathway classes of plants, C3 and 
C, Today, Cy grasses are most common in grasslands and savannas 
within 40° of the Equator®, and are predicted to become more abund- 
ant with increasing temperature’. However, C; plants, owing to their 
CO,-limited photosynthetic metabolism, may become more compe- 
titive in future CO-enriched rangelands*"*. The relative abundances 


of these two photosynthetic types govern spatial and temporal aspects 
of water and nutrient cycling, net primary production and plant- 
animal interactions'*’*'®, yet there is considerable uncertainty about 
which group will be favoured under future, warmer, CO -enriched 
conditions. 

In the Prairie Heating and CO, Enrichment (PHACE) experiment, 
we evaluated the responses of native mixed-grass prairie to 1 year of 
CO, enrichment (2006) (from present ambient (385 p.p.m.v.) to ele- 
vated (600 p.p.m.v.) CO;, treatments c and C respectively), followed by 
3 more years (2007-2009) of combined CO, enrichment and warming 
(present ambient, and elevated, temperatures (1.5/3.0°C warmer 
canopy day/night), treatments t and T respectively) using free-air 
CO, enrichment (FACE) technology for CO, (ref. 17) and T-FACE 
technology for warming’* (Supplementary Appendix I). CO; enrich- 
ment and warming had opposing effects on soil water balance. 
Elevated CO, (Ct) increased annual SWC (17.3% averaged over 
2007-2009), whereas warming reduced it (13.1%), such that there 
was no difference between control (ct; 15.5%) and CO;-enriched 
and warmed plots (CT; 15.6%) (Supplementary Appendix II). The 
same pattern of compensating treatment effects on SWC was observed 
in the early growing season (Fig. 1), the time of year when SWC most 
controls annual productivity in this grassland’”. These results are similar 
to but more consistent than those reported in a semi-arid Australian 
temperate grassland”°, and illustrate that the water conservation effects 
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Figure 1 | Responses of SWC to CO, and warming. Average and s.e.m. (error 
bars) of volumetric SWC (5-25 cm depth) for plots exposed to present-day 
ambient CO, and temperature (ct), 1.5/3 °C day/night warming (cT), 

600 p.p.m.v. CO; (Ct), and 600 p.p.m.v. CO, and 1.5/3 °C day/night warming 
(CT) (five replications per treatment). Data are averaged over days of year 
(DOY) 100-200, the early- to mid-growing season when soil water most limits 
productivity. Precipitation amounts for this same period are also presented. 
Significance (P = 0.05) for main effects and year are given in the figure. 


1USDA-ARS, Rangeland Resources Research Unit and Northern Plains Area, Fort Collins, Colorado 80526, USA. Department of Botany and Program in Ecology, University of Wyoming, Laramie, Wyoming 
82071, USA. 3US Arid-Land Agricultural Research Center, USDA, Agricultural Research Service, Maricopa, Arizona 85238, USA. Departments of Botany, Renewable Resources, and Program in Ecology, 
University of Wyoming, Laramie, Wyoming 82071, USA. °Faculty of Agriculture, Food and Natural Resources, The University of Sydney, Sydney, New South Wales 2006, Australia. 
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Figure 2 | Plant biomass responses to CO, and warming. Treatment effects 
(details in Fig. 1) on biomass in mid-July. AGB and s.e.m. (error bars) are given 
for C, (a) and C; grasses (b), forbs and sub-shrub (c) and total plant community 
(d). Below-ground biomass and s.e.m. (error bars) is given for soil depths of 0-5 


of elevated CO, can completely cancel the desiccating effects of 
moderately warmer temperatures. 

Exposure of the prairie to 600 p.p.m.v. CO, increased peak total 
above-ground biomass (AGB, an estimate of above-ground net primary 
productivity) by an average 33% (Fig. 2d) in the first 3 years of the 
experiment when annual precipitation amounts were within 7% of 
the site 132-year average of 388 mm. We attribute this strong, positive 
response of AGB to CO; in this semi-arid grassland to the higher SWC 
in CO,-enriched plots*. CO, enrichment had no effect on AGB in 2009, 
a relatively wet year for the site when annual precipitation was 17% 
higher than the long-term mean (Supplementary Appendix II). We 
suspect that higher SWC in 2009 (Fig. 1) minimized the potential 
water-relations benefit of CO, enrichment on plant productivity. 

To evaluate further the effect of CO, on SWC and plant productivity, 
we examined how responses of AGB to increasing CO, varied with 
early-season soil matric potential (i/,,) averaged during periods of 
active growth (i, = —1.5 MPa). The ratio of AGB of plants exposed 
to 600 p.p.m.v. CO; to that of plants exposed to present-day CO} (bio- 
mass enhancement ratio) was used for this evaluation. We incorporated 
results from another CO, enrichment experiment in the Colorado 
shortgrass steppe to include the other semi-arid grassland of the western 
Great Plains*' (Supplementary Appendix I, Supplementary Fig. 1). A 
strong negative relationship was observed between w,, and biomass 
enhancement ratio (Fig. 3), which we attribute to (1) the enhancement 
of water use efficiency under elevated CO, and increasing water stress*”, 
and (2) the use of a physiologically relevant environmental driver (/., 
rather than SWC; Supplementary Appendix I, soil water conversions) 
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(e), 5-15 (f) and 15-30 (g) cm. Results from 2006 include only the CO, 
treatment. Significant contrasts between present-day (ct) and future (CT) 
conditions are represented above histograms as ct/CT. P values are given for 
analyses of main effects, interactions and the ct/CT contrast when P = 0.10. 
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Figure 3 | Response of biomass enhancement ratio to soil matric potential. 
Effects of early-season (DOY 100-200) rooting zone soil matric potential (/.,) 
on biomass enhancement ratio, the ratio of mid-July harvested above-ground 
plant biomass in CO,-enriched plots divided by plant biomass from ambient 
CO, plots. Data are from the PHACE experiment (4 years’ data, n = 4), and 
from a previous open top chamber CO, enrichment experiment (5 years’ data, 
n = 5) conducted on Colorado shortgrass steppe’'. For further details, see 
Supplementary Appendix I, soil water conversions. 
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averaged over periods of physiological activity. This robust relationship 
from two different ecosystems suggests that CO, will increase plant 
productivity most when plants are water-limited yet still active. 

The positive effect of CO on production was also apparent in 
below-ground plant biomass. However, because root turnover in 
semi-arid perennial grasslands occurs every 5-7 years (ref. 22), we 
did not detect a significant (P = 0.05) effect on roots until the fourth 
year of our experiment (2009), when below-ground plant biomass in 
the 0-5 cm depth layer was 32% higher in CO2-enriched plots (Fig. 2e). 

Warming has the potential to increase production by extending the 
duration of the growing season, and by increasing nitrogen uptake, 
efficiency of nitrogen use” and biological activity. In our experiment, 
however, warming by itself did not significantly affect total AGB 
(Fig. 2d) or below-ground plant biomass (Fig. 2e-g), despite increased 
plant nitrogen content”. Warming-induced desiccation in this semi-arid 
mixed-grass prairie (Fig. 1) and the preponderance of cool-season C3 
grasses (55% of AGB) probably limited its ability to respond positively to 
warming. 

Although water was a dominant driver in the responses of both C, 
and Cy, grasses to CO2 and temperature, there was also evidence of 
unique functional group responses. C3 grass AGB was 34% greater in 
CO,-enriched compared with ambient plots from 2006 to 2008 
(P< 0.05 in 2006 and 2008; P = 0.1 in 2007) (Fig. 2b), but did not 
respond to warming. In contrast, AGB of Cy grasses was 28% greater in 
CO,-enriched plots in the first 2 years, but was also greater under 
warming (P = 0.10), most notably the wet year (2009) when C, AGB 
was 23% greater in heated plots (P = 0.05) (Fig. 2a). Comparing pre- 
sent (ct) to future (CT) conditions, C3 grasses did not respond, but Cy 
AGB significantly increased with the combination of warming and 
CO, enrichment in 2007 and 2008 (Fig. 2). These results are consistent 
with our physiological understanding of C3; and Cy, plant functional 
types*’*, but they demonstrate for the first time in a field experiment a 
potential advantage for Cy grasses in a CO -enriched and warmer 
future. Further research will be needed to determine the extent to 
which these results can be applied to other C3 and Cy, grasses and 
grassland ecosystems (Supplementary Appendix III, Cy versus C; 
responses). Small population sizes and high year-to-year variability 
limited our capacity to detect treatment effects on forbs and sub- 
shrubs (Fig. 2c). 

Our results suggest that even with modest increases in plant growth 
(2007), rising atmospheric CO, may counter the pervasive desiccating 
effects of warming in semi-arid grassland ecosystems. The most likely 
explanation is a CO2-induced suppression of transpiration*. A model- 
ling analysis using our site’s weather data shows the effects of different 
hypothetical increases in canopy resistance to water loss (r,) such as 
might occur from stomatal closure, and different temperature increases 
on grass reference evapotranspiration rate (ET,.g Fig. 4), the evapo- 
transpiration rate of non-stressed grassland. Differences in ET,.¢ reflect 
not only the rate of evapotranspiration when soil water is readily avail- 
able to plants, but also how long grass can continue to transpire and 
grow after major rain events before exhausting its soil water supply. At 
ambient CO,, a daytime warming of 1.5 °C (our cT treatment daytime 
temperature; roughly 90% of evapotranspiration occurs during day- 
time) is predicted to increase ET,.¢ by 7%. At ambient temperature, an 
increase in r, of 30% would be required to decrease ET ¢ by a similar 
amount, 6%. This condition is similar to our Ct treatment (Fig. 4); leaf 
gas exchange measurements on the dominant C3 and Cy, grasses of 
this* and other natural ecosystems suggest a 30-40% increase in 
stomatal resistance occurs in native grasses under CO,-enriched and 
non-stressed conditions. Thus, when daytime warming of 1.5 °C is 
combined with an increase of 30% in r, (our CT treatment), the tem- 
perature effect and the CO, effect through increased stomatal resist- 
ance almost exactly offset one another, so that an increase of about only 
1% in ET,.¢ is predicted. This modelling result mirrors the observed 
offsetting effects of CO, and temperature on SWC (Figs 1 and 
Supplementary Figure 4). The analysis does not account for treatment 
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Figure 4 | Percentage changes in ET,, for a grass surface as affected by 
temperature and changes in r,. Percentage changes in ET,-¢ calculated using 
the American Society of Civil Engineers standardized evapotranspiration 
equation versus a range of temperature increases following Kimball”. 
Calculations used observed weather data from the PHACE project for the 1 
April — 16 October 2007, 2008 and 2009 growing seasons. The total calculated 
ET, ¢for the three seasons was 2,490 mm (average of 5.1 mm d~'). Calculations 
were done for zero, +15, +30 and +45% changes in canopy resistance to water 
vapour (r,), as might be expected from CO,-induced stomatal closure. For 
further details, see Supplementary Appendix III, Supplementary Fig. 5. 


effects on leaf area, although such differences were probably small 
compared with 30-40% increases in stomatal resistance we would 
expect under CO, enrichment (Supplementary Appendix III, evapo- 
transpiration modelling). A protracted drought that depletes soil water 
stores will of course eventually eliminate these CO,-induced water 
savings; we have yet to experience such conditions in this experiment. 
Thus, although rising CO, may ameliorate the effects of warming- 
induced drought through plant stomatal closure, it is unlikely to 
eliminate severe and protracted droughts predicted for rangelands in 
regions like southwestern North America or the Mediterranean, where 
both substantive temperature increase and reduced precipitation are 
predicted’”. 

The particular responses reported here are to a single set of simulated 
global change conditions that are within the prediction possibilities of 
the Intergovernmental Panel on Climate Change for the latter half of 
this century’. The manipulations are not without their artefactual 
issues, which must be considered in scaling such information tem- 
porally and beyond the experimental site (Supplementary Appendix 
Ill, warming & CO, methodologies). Nevertheless, our results clearly 
illustrate the importance of compensating CO, and warming effects in 
semi-arid ecosystems. These contrasting, water-mediated effects must 
be accounted for in accurately predicting the susceptibility of such 
systems to climate change. 


METHODS SUMMARY 
Experiment. The experiment was conducted in northern mixed-grass prairie west 
of Cheyenne, Wyoming, USA. It had a factorial combination of two levels of CO2 
(385 p.p.m.v. and elevated 600 p.p.m.v.), and two temperature (present ambient, 
and elevated (1.5/3.0 °C warmer day/night)) regimes, with five replications each. 
Treatments were randomly assigned to 20 3.3-m diameter circular plots in two soil 
types (blocks). Details of the experimental site and set up are available elsewhere”. 
AGB was measured by species in 2005 (the year before CO, treatments), in 2006 
(the first year of CO2 treatments) and from 2007 to 2009 (the years of combined 
elevated CO, and warming treatments) during mid-July. A metal wire grid divided 
into 24-25 cm X 25cm quadrats (1.5 m? total) was placed over each plot and 
vegetation in every other quadrat (12 in total) was clipped to the crown, dried 
at 60 °C and weighed. Plant species were subsequently grouped into three func- 
tional groups for analyses: C4 grasses, C3 grasses, and Forbs and sub-shrub. 
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Beginning in 2007, three soil cores (30 cm deep, 5 cm diameter) were collected at 
about the same time as the AGB samples from each plot, divided into 0-5, 5-15 
and 15-30 cm depths, and the three core samples composited into one sample per 
depth. Fine roots (less than 1 mm) were handpicked from the composited samples, 
and root fragments were washed, weighed and ash-corrected. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Site description. The experiment was conducted at the US Department of 
Agriculture-Agricultural Research Service High Plains Grasslands Research 
Station, west of Cheyenne, Wyoming, USA (41° 11’ N, 104° 54’ W), elevation 
1930 m. The ecosystem is a northern mixed-grass prairie, with a plant community 
of about 55% cool-season C3 grasses (mostly Pascopyrum smthii and Hesperostipa 
comata, both perennials), 25% warm-season Cy, grasses (almost exclusively the 
perennial Bouteloua gracilis), and 20% sedges, forbs and small shrubs. Annual 
precipitation is 384mm, mean air temperatures are 17.5°C in summer and 
—2.5°C in winter. The average annual wind speed is 6ms_ ', with a growing 
season wind speed of 4.1ms_'. 

A 2.4-ha site, which had been grazed by cattle at least since 1974, was fenced to 
prevent cattle entering in 2005. Soils are an Ascalon variant loam (fine-loamy, 
mixed-mesic) at the north end of the study site, and an Altvan loam (fine-loamy 
over sandy, mixed-mesic) at the south end. 

Experimental layout. The experiment has a factorial combination of two levels of 
CO) (present ambient (385 p.p.m.v.), and elevated (600 p.p.m.v.), c and C respec- 
tively), and two temperature (present ambient, and elevated (1.5/3.0 °C warmer 
day/night), t and T respectively) regimes, with five replications making 20 experi- 
mental plots. Assuming an annual increase of CO; concentration of 2.3 p.p.m.v., 
our treatment combinations represent CO, and temperatures expected near the 
end of this century. We chose differential day/night temperatures based on the 
Fourth Intergovernmental Panel on Climate Change assessment, which suggests 
that daily minimal temperatures will increase faster than maximal temperatures”. 
Furthermore, the equilibrium global mean surface air temperature for a doubling 
of atmospheric CO is likely to lie between 2 and 4.5°C. For our target CO, 
concentration of 600 p.p.m.v., our average day/night temperature of 2.25 °C lies 
at the upper third of a range we calculate to be between approximately 1.1-2.5 °C. 
The four treatment combinations were designated as follows: ct, ambient CO, 
non-heated; Ct, elevated CO;, non-heated; cT, ambient CO3, heated; CT, elevated 
COs, heated. Treatments were randomly assigned to 20 plots in northern and 
southern soil-type field blocks. Details are available elsewhere”, including the 
FACE” and infrared warming’® apparatus used to alter ambient CO, and tem- 
perature (see also Supplementary Appendix I). 

Experimental rings and microclimate. To accommodate the FACE system, 
experimental plots were circular with a diameter of 3.3m (area of 8.6m”). 
Before instrumentation was installed, hexagonal trenches 60cm deep were dug 
around the circumference of each plot and a plastic barrier installed to isolate 
treated plots hydraulically from outside non-treated soils. This hydraulic separa- 
tion helped maintain any water-relations effects of the treatments. 

Most of the plot area was maintained as native northern mixed prairie and was 
partitioned into soil and plant sampling areas, plus sections for other measure- 
ments, including SWC and air and soil temperature. Volumetric SWC was 


measured daily using frequency domain reflectometry sensors at 10 and 20cm 
soil depths (Sentek Envirosmart sensors, Sentek Sensor Technologies) placed 
approximately 75 cm from the ring centres. Daily total precipitation was recorded 
with a rain gauge (Onset corp. S-RGA-M002). 

Plant biomass. AGB was measured by species in 2005 (the year before CO) treat- 
ments), in 2006 (the first year of CO; treatments) and from 2007-2009 (the years of 
combined elevated-CO, and warming treatments) during mid-July. Mid-July is the 
approximate time of peak seasonal AGB, and provides a good estimate of above- 
ground net primary production. A metal wire grid divided into 24-25 cm X 25cm 
quadrats (1.5 m* total) was placed over each plot (all sampled vegetation being 
within a metre of the ring centre), and vegetation in every other quadrat (12 in total) 
was clipped to the crown, dried at 60°C and weighed. This defoliation protocol 
removed 50% of the green vegetation and represented the grazing removal for these 
grasslands. To correct for initial plot differences in species abundance, the 2005 pre- 
treatment harvest data were used to calculate overall means of each plant species’ 
biomass across all 20 plots, plus the deviation for each species from that mean in 
every plot. These deviations were either added to or subtracted from individual 
species’ biomass data collected in all plots for the subsequent treatment years, 
2006-2009, thereby correcting for initial plot differences in plant species abund- 
ance. Plant species were subsequently grouped into three functional groups for 
analyses: C, grasses, C; grasses and sedges, and forbs and sub-shrub. 

Beginning in 2007, near the time of the peak standing AGB harvest, three soil 

cores (30 cm deep, 5 cm diameter) were collected from each plot, divided into 0-5, 
5-15 and 15-30 cm depths, and composited into one sample per depth. Fine roots 
(less than 1 mm) were handpicked from the composited samples, and root frag- 
ments were washed, weighed and ash corrected. 
Statistical analyses. Data analysis used SAS/STAT software, version 9.2, Proc 
GLIMMIX, © 2002-2008 SAS Institute. Mean AGB and below-ground plant bio- 
mass were compared among all four treatments (five replications each) and years 
by fitting a repeated-measures general linear model to biomass. Plots within 
treatments represented subjects on which repeated measures were taken. 
Treatments and year were fitted as fixed-effects. Contrasts were constructed to 
estimate and test the effects of CO,, warming and their interaction on biomass 
within years, owing to climatic differences among the four years. We hypothesized 
that increasing CO2 would increase plant biomass (one-tailed test), but that warm- 
ing would have no net effect on biomass (two-tailed test). Analyses of mean SWC 
(averaged annually for days of year 100-200) were conducted across years 2007-— 
2009 when both CO, and warming treatments were operational. Regression curves 
of SWC and y,,, with biomass enhancement ratio using data from the PHACE and 
open top chamber experiments were computed using JMP software, version 8.0.1, 
2009 SAS Institute. Throughout, a P value of = 0.05 is the standard for indicating 
significance, although marginal significance (P = 0.10) is reported in figures, and 
sometimes commented on in the text as deemed appropriate. 
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Initial radiation of jaws demonstrated stability 
despite faunal and environmental change 


Philip S. L. Anderson!, Matt Friedman”, Martin D. Brazeau*+ & Emily J. Rayfield! 


More than 99 per cent of the roughly 58,000 living vertebrate species 
have jaws’. This major clade, whose members are collectively known 
as gnathostomes (‘jawed mouths’), made its earliest definitive 
appearance in the Silurian period, 444-416 million years (Myr) 
ago, with both the origin of the modern (crown-group) radiation 
and the presumptive invasion of land occurring by the end of the 
Devonian period’ (359 Myr ago). These events coincided with a 
major faunal shift that remains apparent today: the transition from 
Silurian ecosystems dominated by jawless fishes (agnathans) to 
younger assemblages composed almost exclusively of gnathos- 
tomes*’. This pattern has inspired several qualitative descriptions 
of the trophic radiation and ecological ascendance of the earliest 
jawed vertebrates* ’. Here we present a quantitative analysis of func- 
tional variation in early gnathostome mandibular elements, placing 
constraints on our understanding of evolutionary patterns during 
this critical interval. We document an initial increase in functional 
disparity in the Silurian that stabilized by the first stage of the 
Devonian, before the occurrence of an Emsian (~400 Myr ago) oxy- 
genation event implicated in the trophic radiation of vertebrates’. 
Subsequent taxonomic diversification during the Devonian did not 
result in increased functional variation; instead, new taxa revisited 
and elaborated on established mandibular designs. Devonian func- 
tional space is dominated by lobe-finned fishes and ‘placoderms’; 
high disparity within the latter implies considerable trophic innova- 
tion among jaw-bearing stem gnathostomes. By contrast, the major 
groups of living vertebrates—ray-finned fishes and tetrapods— 
show surprisingly conservative mandibular morphologies with little 
indication of functional diversification or innovation. Devonian 
gnathostomes reached a point where they ceased to accrue further 
mandibular functional disparity before becoming taxonomic domi- 
nants relative to “ostracoderm’-grade jawless fishes, providing a new 
perspective on classic adaptive hypotheses concerning this fun- 
damental shift in vertebrate biodiversity. 

Morphological disparity has often been used as a proxy for eco- 
logical/functional variety in the fossil record’. However, divergent 
anatomies can be biomechanically similar'’"’, leading to decoupling 
of morphological and functional metrics'*"*. To evaluate potential 
functional diversity, we have devised a measure of variation derived 
from biomechanical traits. We focus on the mandible because it is well 
characterized functionally in modern fishes’* and because it is the key 
character proposed by adaptive schemes to separate the gnathostomes 
from their jawless predecessors*®. We document how functional vari- 
ation accumulates after the origin of jaws and test the hypothesis that, 
during their rise to dominance, gnathostomes outcompeted or dis- 
placed jawless fishes. 

We assembled a data set consisting of 198 Silurian/Devonian 
gnathostome genera scored for 31 mandibular traits (11 continuous 
and 20 discrete), many of which have been shown to be functionally 
relevant in living taxa. These data were analysed using non-metric 
multidimensional scaling, yielding a multivariate ‘function space’. 
Functional disparity was calculated across eight time bins (the late 


Silurian plus the seven stages of the Devonian) using a variety of 
measures (sum of variances is presented here because it is relatively 
robust to variation in sample size’®). We tested for differences between 
successive bins using multiple approaches (likelihood ratios and 
t-tests), and used a jackknifing procedure that pruned exceptional 
fossil assemblages from our data set to examine the sensitivity of results 
to ‘Lagerstatten effects’, biases arising from localities characterized 
by exceptional preservation or fossil abundance. Full details of our 
methodology are given in Supplementary Information. 

Our results show remarkably stable levels of lower-jaw functional 
disparity during the Devonian (Fig. 1a). There is a trend of increasing 
disparity from the late Silurian to the end of the Early Devonian (the 
Emsian stage); however, by the earliest Devonian (the Lockhovian stage, 
~415 Myr ago) functional mandibular disparity attained a level statist- 
ically indistinguishable from that found in the Late Devonian. When P 
values are adjusted for multiple comparisons, the only significant dis- 
parity shift between successive Devonian stages (corrected P value, 
0.013) is a decline on entering the Middle Devonian (the Eifelian stage, 
~398 Myr ago; Fig. 1a). This seems to reflect a reduction in the number 
of marine sites sampled in the Eifelian, combined with the Emsian 
Taemas/Wee Jasper fauna, which contains a series of extensively studied 
durophagous lungfishes with exceptionally robust jaws. When this site is 
removed in jackknife analyses, the drop in disparity is no longer signifi- 
cant (corrected P value, 0.112). When differences are assessed using 
likelihood ratios, none exceeds a threshold value of 8 (ref. 17; likelihood 
ratio range, 1.023-4.34). This pattern of functional stability arises despite 
an increase both in the number of mandibles sampled per bin over our 
study interval and in the taxonomic diversity of gnathostomes during the 
Devonian’’, and is robust to the removal of exceptional Lagerstatten. 
Details of all analyses are given in Supplementary Information. 

A breakdown of morphospace by stage shows that initial functional 
disparity in the late Silurian was localized in the lower-left and upper- 
right quadrants (Fig. 2). The former is populated by gnathostomes (for 
example stem sarcopterygians and ischnacanthid ‘acanthodians’) with 
gracile mandibles bearing slender, pointed cusps and characterized by 
low closing mechanical advantages, whereas the latter contains taxa 
(for example mesacanthid acanthodians) with toothless jaws. The 
Lockhovian expansion into the lower-right quadrant reflects the appear- 
ance of taxa (for example the lungfish Diabolepis) with robust jaws 
characterized by both high closing mechanical advantage and high 
second moment of area, two features consistent with durophagy. 
Subsequent mandibular evolution during the Devonian resulted in novel 
designs that elaborated existing ones, but did not substantially exceed the 
boundaries established earlier in the interval (Fig. 2). Specific anatomical 
correlates of our ordination axes are provided in Supplementary Table 2. 

We examined patterns of partial disparity’” to dissect the contribu- 
tions to overall functional variation made by major taxonomic assem- 
blages: “Acanthodii’, an extinct assemblage sometimes called spiny 
sharks; Actinopterygii, or ray-finned fishes, related to most fishes alive 
today; Chondrichthyes, or cartilaginous fishes, the precursors of rays, 
sharks and ratfishes; ‘Placodermi’, armoured stem gnathostomes; and 
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Figure 1 | Functional mandibular disparity among Silurian/Devonian 
gnathostomes. All horizontal axes show time as indicated at bottom. 

a, Disparity (sum of variances) across eight time bins. The dark-grey region 
spans the 95% confidence intervals based on 1,000 bootstrap pseudoreplicates. 
b, Relative contributions (partial disparity) of major gnathostome groups to 
overall functional disparity. Orange, ‘Acanthodii’; green; Sarcopterygii; blue, 
‘Placodermi’; yellow, Chondrichthyes; red, Actinopterygii. c, Faunal 


Sarcopterygii, or lobe-finned fishes, including lungfishes and terrestrial 
vertebrates. Sarcopterygians make a major and relatively stable contri- 
bution to mandibular disparity throughout our study interval (Fig. 1b). 
The earliest definitive digit-bearing tetrapods (Acanthostega and 
Ichthyostega) and their immediate relatives (Ventastega, Elginerpeton, 
Densignathus and Metaxygnathus) show a conservative pattern in 
mandibular function relative to sarcopterygian ‘fishes’ (Fig. 2). This 
is in contrast to studies that report a distinct shift in cranial suture 
patterns between aquatic taxa and early digit-bearing tetrapods”, sug- 
gesting that different parts of the feeding system evolved in separate 
phases. By contrast, lungfishes are a major source of mandibular 
innovation among sarcopterygians, colonizing a new region of func- 
tional space in the Lockhovian and occupying it exclusively throughout 
the Devonian (Fig. 2). 

‘Placoderms’ occupy 50% of realized Devonian gnathostome mor- 
phospace volume (Fig. 2), indicating that considerable functional dif- 
ferentiation occurred on the gnathostome stem, independently of the 
origin of modern jawed vertebrates. ‘Placoderm’ functional variety 
remains undiminished until their sudden extinction at the Devonian/ 
Carboniferous boundary. A direct reading of our results implies that 
‘placoderms’ were a major contributor to gnathostome disparity only 
from the Middle Devonian onward. We argue that this is an artefact; 
many ‘placoderms’ are known from the Early Devonian and, increas- 
ingly, the Silurian*”’, but few preserve mandibular material that can be 
included in our analysis. This suggests that disparity measures for the 
Silurian and Early Devonian are probably conservative underestimates, 
and that the addition of more ‘placoderms’ of this age would only 
reinforce the pattern of early functional morphospace saturation. 

The remaining three gnathostome assemblages—‘acanthodians’, 
chondrichthyans and actinopterygians—make relatively minor 


composition data for the late Silurian and Devonian. Discs represent individual 
vertebrate assemblages plotted as a function of time and proportion of 
gnathostomes that comprise those faunas (discs jittered within time bins for 
clarity). The area of each disc is proportional to the total number of vertebrate 
genera represented, Mgenera. Eifel., Eifelian stage; Givet., Givetian stage; Loc., 
Lockhovian stage; Lu., Ludlow series; Pr., Pridoli series; Pra., Pragian stage. 


contributions to overall disparity (Fig. 1b). “Acanthodians’ make a 
major contribution to functional variety during the Silurian, but their 
relative importance wanes substantially throughout the Devonian, 
reflecting a decline in functional diversity and apparent richness com- 
bined with the taxonomic ascendance of other gnathostome clades’. 
Actinopterygians, the most speciose group of living jawed fishes’, con- 
tribute little to overall disparity in the Devonian, with their few repre- 
sentatives conservatively clustering within tightly delimited regions of 
function space (Fig. 2). This is consistent with previous arguments, 
drawn from taxonomic data, that actinopterygians were not ecologi- 
cally diverse until the Carboniferous period’’. The reliability of low 
measures of chondrichthyan partial disparity is unclear, however, 
because the numerous Devonian chondrichthyans known exclusively 
from dental remains” cannot be included in our analysis. 

Our results provide a new perspective on debates concerning the 
Devonian shift from agnathan- to gnathostome-dominated fossil assem- 
blages, a transition that remains apparent in modern vertebrate fauna’”’. 
A range of schemes, largely derived from anecdotal evidence, have 
sprung up in response to this pattern of turnover’ ’. Faunal data clearly 
show that gnathostomes shared habitat space evenly with ‘ostracoderm’- 
grade agnathans well into the Early Devonian (Fig. 1c). It is only during 
and after the Emsian that gnathostomes became taxonomic dominants 
in most fossil assemblages, several million years after they explored the 
extremes of realized Devonian mandibular function space. The offset 
between the trophic radiation of gnathostomes (Fig. 1a) and the taxo- 
nomic collapse of ‘ostracoderms’ (Fig. 1c), combined with the long 
history of coexistence between these two groups, argues against the 
direct ecological replacement of jawless fishes by jawed forms’. 
Furthermore, we find no evidence for a significant positive relationship 
between mandibular disparity within assemblages and the taxonomic 
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Figure 2 | Patterns of functional morphospace occupation for jawed 
vertebrates during the late Silurian and Devonian. Points represent averages 
for genera plotted on coordinate axes 1 and 2 of the non-metric 
multidimensional scaling ordination. Orange circles, ‘Acanthodii’; green down- 
triangles, Sarcopterygii (tetrapods indicated by larger symbols outlined in 


dominance of jawed vertebrates within those faunas (measured as the 
proportion of gnathostomes; Pearson and Kendall rank-order correla- 
tions, P = 0.90-0.94; weighted least-squares regression, P = 0.40- 
0.44). This pattern is inconsistent with the assertion® that gnathostome 
diversification primarily reflects the ecological displacement of 
agnathans and subsequent refilling of their functional roles. Indeed, 
few of the feeding strategies apparently taken up by early gnathos- 
tomes, such as macropredation and durophagy, have any clear ana- 
logue among jawless taxa’. Instead, the proliferation of gnathostomes 
probably reflects new ecological opportunities associated with the origin 
of jaws and, perhaps just as importantly, the shift from benthic lifestyles 
to pelagic ones”. 

The Devonian radiation of gnathostomes is marked by an early 
plateau of functional disparity and a consistent range of mandibular 
variation in the face of major environmental perturbations, including 
fluctuating global temperatures”, shifting continental weathering 
patterns”®, sea level changes”® and a major extinction event’’. Various 
proposals have linked patterns of early vertebrate evolution to changes 
in atmospheric composition, with parallel claims that gnathostome 
diversification was driven by increases in oxygen levels during the 
Silurian’® or the Devonian’. Silurian data are too scant to comment 
on the first proposal, but we note that gnathostomes achieved stable 
levels of functional diversity before the occurrence of an Emsian 
oxygenation event (~400 Myr ago) implicated in the trophic radiation 
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black); blue up-triangles, ‘Placodermi’; yellow diamonds, Chondrichthyes; red 
squares, Actinopterygii. The area in the lower-right corner of the functional 
morphospace is occupied solely by lungfishes. See text and Supplementary 
Information for further details. 


of vertebrates®. There is no significant shift in functional disparity 
between the Frasnian and Famennian stages (~375 Myr ago; Fig. 1) 
correlating with the Kellwasser event, which is classically identified as 
one of the ‘Big Five’ Phanerozoic extinctions”. This event resulted in a 
global reef collapse”; however, jawed vertebrates seem to be unaffected 
in terms of diversity, functional disparity and potentially ecological 
variation. More broadly, our results imply constraints on the func- 
tional diversity of the gnathostome mandibular system, with early 
functional saturation and subsequent refilling of established biome- 
chanical roles by new taxonomic actors. 
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Many of the cognitive deficits of normal ageing (forgetfulness, dis- 
tractibility, inflexibility and impaired executive functions) involve 
prefrontal cortex (PFC) dysfunction’*. The PFC guides behaviour 
and thought using working memory’, which are essential functions 
in the information age. Many PFC neurons hold information in 
working memory through excitatory networks that can maintain 
persistent neuronal firing in the absence of external stimulation’. 
This fragile process is highly dependent on the neurochemical 
environment’. For example, elevated cyclic-AMP signalling reduces 
persistent firing by opening HCN and KCNQ potassium channels*’. 
It is not known if molecular changes associated with normal ageing 
alter the physiological properties of PFC neurons during working 
memory, as there have been no in vivo recordings, to our knowledge, 
from PFC neurons of aged monkeys. Here we characterize the first 
recordings of this kind, revealing a marked loss of PFC persistent 
firing with advancing age that can be rescued by restoring an optimal 
neurochemical environment. Recordings showed an age-related 
decline in the firing rate of DELAY neurons, whereas the firing 
of CUE neurons remained unchanged with age. The memory- 
related firing of aged DELAY neurons was partially restored to 
more youthful levels by inhibiting cAMP signalling, or by blocking 
HCN or KCNQ channels. These findings reveal the cellular basis of 
age-related cognitive decline in dorsolateral PFC, and demonstrate 
that physiological integrity can be rescued by addressing the 
molecular needs of PFC circuits. 

Our society is rapidly ageing, with the number of seniors in the 
United States expected to double by 2050 (United States census, 
http://www.census.gov/population/www/pop-profile/elderpop.html). 
At the same time, the information age requires increasing organiza- 
tional skills to deal with even basic needs such as medical care and 
paying bills. However, executive and working memory functions 
decline early in the normal ageing process'*"”’, beginning in middle 
age'*'’. Thus, cognitive changes with advancing age may be costly, 
forcing retirement from demanding careers and jeopardizing the abil- 
ity to live independently in an increasingly complex society. Ageing 
monkeys provide an ideal model to reveal the neurobiology of normal 
ageing, as they have a highly developed PFC, but are not subject to age- 
related dementias’®. Thus, one can be certain that cognitive changes 
are the result of normal ageing and not incipient Alzheimer’s disease. 
Like humans, monkeys begin to develop deficits in executive function 
as early as middle age’’. Both aged monkeys’*"” and humans*”° are 
impaired on working memory tasks that require constant updating of 
the contents of memory (Supplementary Information), bringing to 
mind information from longer-term stores (for example, where did I 
leave my car keys this time?), or keeping in mind a recent event (for 
example, remembering a new phone number). 

In primates, spatial working memory depends on the highly evolved 
dorsolateral PFC® (Fig. 1a). Spatial working memory performance 
(Fig. 1b) relies on networks of pyramidal neurons that interconnect 
at dendritic spines (Fig. 1c), and excite each other to keep information 
‘in mind’, that is, generating persistent spiking activity over a delay 


period in a working memory task® (Fig. 1d). This ability to maintain 
information that is no longer in the environment is a fundamental 
process needed for abstract thought and flexible responding’. 
Intracellular signalling pathways modulate the physiological strength 
of these recurrent, excitatory PFC network connections’. Recent data 
show that increased cAMP signalling weakens network connectivity by 
opening potassium channels, whereas inhibiting cAMP signalling and/ 
or closing these channels strengthens connectivity and cognitive ability” 
(Fig. le). Specifically, cAMP signalling seems to weaken persistent firing 
and impair working memory by increasing the open state of HCN 
(hyperpolarization-activated cyclic nucleotide-gated) channels that 
are localized on spines where networks interconnect*. Recent data 
indicate that HCN channels may also gate synaptic inputs through 
interactions with KCNQ channels, whose open state is increased by 
cAMP-activating protein kinase A (PKA)’’. Studies indicate that 
cAMP signalling is disinhibited in the aged PFC”. Noradrenergic 
O% receptor inhibition of cAMP may be reduced from loss of a, 
receptors in the aged PFC”, and decreased excitation of noradrenergic 
neurons”. 

There have been few electrophysiological recordings from aged PFC 
neurons owing to the demanding nature of this procedure. Recordings 
from rat orbital PFC found reduced flexibility in aged neurons”. 
However, there have been no in vivo recordings from the aged dorso- 
lateral PFC, even though behavioural data indicate that this region is 
particularly vulnerable to normal ageing. In vitro recordings from 
dorsolateral PFC neurons found relatively subtle changes in excitabil- 
ity with advancing age’®, but their consequences for executive function 
must be observed in a cognitively engaged circuit. Here we perform the 
first physiological characterization of PFC neuronal response during a 
working memory task in young adult, middle-aged and aged monkeys. 

Monkeys (Macaca mulatta, n = 6) were trained to perform a spatial 
working memory task in which they have to remember a spatial loca- 
tion over a brief delay period; the spatial location changes randomly on 
each trial (Fig. 1a). Two animals were young adults (7- and 9-year-old 
males), two were middle-aged (12- and 13-year-old males), and two 
were aged (17-year-old male, 21-year-old female). Short delays (2.5 s) 
were used in all age groups to ensure similar performances (>85% per 
cent correct) across age groups. Neurons (n = 301) were recorded 
from area 46, the dorsolateral PFC subregion most needed for visuos- 
patial working memory (Fig. 1a). Neurons were characterized based on 
task-related firing as responsive during (1) the visuospatial cue period, 
(2) the delay period when the spatial position was being remembered, 
and/or (3) the motor response period. Some neurons fired only during 
cue presentation (CUE cells, m = 28), whereas most neurons fired 
during the delay period as well as to the cue and/or response periods 
(DELAY cells, n = 273). Persistent firing during the delay period is of 
particular interest, as it is required for working memory*®. Many PFC 
DELAY neurons increased their activity during the memory of one 
spatial location (its preferred direction), but not other locations (the 
‘anti-preferred’ direction, 180° away from the preferred direction; 
Fig. 1d). 
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Figure 1 | Age-related changes in the PFC networks that subserve working 
memory. a, The region of the dorsolateral PFC most needed for spatial 
working memory, and the site of recordings. AS, arcuate sulcus; PS, principal 
sulcus. b, The oculomotor delayed response (ODR) spatial working memory 
task. The monkey fixates on the central stimulus and maintains fixation for the 
duration of the trial. A cue is briefly presented in one of eight locations, followed 
by a delay period (2.5 s) in which no spatial information is present. At the end of 
the delay period, the fixation spot disappears, and the monkey makes an eye 
movement (saccade) to the remembered location for a juice reward. The cue 
position randomly changes on subsequent trials. c, A diagram of the recurrent 
excitatory networks subserving working memory. Pyramidal cells with similar 
spatial tuning excite each other to maintain persistent firing across the delay 
period®. These networks are concentrated in deep layer III°. Spatial tuning is 
enhanced by GABAergic lateral inhibition (not shown). d, An example of a 
dorsolateral PRC DELAY neuron with spatially tuned, persistent firing during 
the delay period. This neuron shows increased firing for the cue, delay and 
response for the neuron’s preferred direction (highlighted in blue), but not for 
nonpreferred directions (white backgrounds). The anti-preferred direction 
opposite to the neuron’s preferred direction is shown in red; note that 
subsequent figures show only the preferred and anti-preferred directions for the 
sake of brevity. e, Pyramidal cells synapse on spines where cAMP-PKA 
signalling regulates the open state of HCN and KCNQ channels, and thus 
modulates the strength of network connections’. f, Population average activity 
for the dorsolateral PRC DELAY neurons recorded in each age group (102, 101 
and 70 neurons for young, middle-aged and old monkeys, respectively). 
Colours indicate the activity during the trials in which the cue was presented in 
the neuron’s preferred (blue) and anti-preferred (red) directions; the darker 
grey background refers to the cue period; the lighter grey background to the 
delay period. Error envelope represents s.e.m. 


The firing of DELAY cells was markedly reduced with advancing 
age (Figs 1f and 2). Figure 1f portrays the differences in firing rates 
across the population of DELAY neurons in young, middle-aged and 
aged animals (for individual examples of DELAY neurons in young, 
middle-aged and aged monkeys, see Supplementary Fig. 1). There was 
a significant decline in the spontaneous firing rate of DELAY cells, as 
well as a marked decline in task-related firing. Figure 2 shows a steep 
decline in the firing rates of DELAY cells across the age span (t-test on 
age variable in regression analysis, P< 10 * for all epochs), with older 
animals showing a restricted range of lower firing rates (Supplemen- 
tary Fig. 2). This age-related activity decline persisted throughout the 
2.5-s delay period (without main effect of epoch (0.5 s) or age X epoch 
interaction in repeated measures ANOVA, P > 0.25). Additional con- 
trol analyses showed that age-related decline in the firing rate of 
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Figure 2 | Age-dependent decline in the spatially tuned, persistent firing of 
dorsolateral PFC DELAY neurons. a, Marked reduction of dorsolateral PFC 
DELAY activity for the neurons’ preferred direction with advancing age. 
Activity of individual neurons of each animal was averaged separately for the 
last 0.5 s during the intertrial interval (pre-fix, grey), the fixation period (fix, 
black), the cue period (cue, red), and the delay period for the neuron’s preferred 
direction. Firing during the delay period is represented in a successive series of 
0.5s intervals (colour coded from yellow to blue). Lines were obtained using 
linear regression. b, Firing rates during the delay period for the anti-preferred 
direction of the same neurons shown in a. There was a significant age-related 
decline in all epochs, but it was less prominent than the decline in firing for the 
preferred direction during the delay period. Colour coding for each 0.5 s 
interval is as in a. c, Age-related decline in spatial tuning, whereby the difference 
between firing for the preferred versus anti-preferred directions during the 
delay period declines with advancing age. Colour coding is as in a. d, Age- 
related decline in d’, that is, the ability to distinguish preferred from anti- 
preferred spatial directions based on firing rate patterns during the entire delay 
period. Error bars represent s.e.m. 


DELAY cells is not due to a sampling bias during the recording experi- 
ment (see Supplementary Information). The age-related decline in 
firing rate was particularly prominent during the cue and delay periods 
for the neuron’s preferred direction (Fig. 2a); the decline in firing for 
the anti-preferred direction (Fig. 2b) or before target onset (Fig. 2a) 
was less pronounced. Consequently, the difference in delay-related 
firing for the neuron’s preferred direction versus its anti-preferred 
direction eroded with increasing age (t-test, P< 10 °, for cue period 
and every 0.5s epoch in the delay period; Fig. 2c), largely due to 
reduced firing for the neuron’s preferred direction (Fig. 2a). This led 
to a reduction in d’ with advancing age (t-test on age versus d’ cor- 
relation coefficient, P< 0.0001), that is, a reduced ability to distinguish 
the preferred from anti-preferred directions during the delay period 
when spatial information was held in working memory (Fig. 2d). 
These results are consistent with studies showing impairment in spatial 
working memory in aged monkeys at relatively short (for example, 5 s) 
delays'*, and single-unit data as well as neural circuit modelling indi- 
cate that inadequate PFC recurrent network firing underlies the deficits 
in PFC cognitive function observed in ageing monkeys and humans 
(Supplementary Figs 3, 4 and Supplementary Information). 

In contrast to DELAY neurons, which showed prominent decline in 
firing with advancing age, there were no age-related changes in the 
firing rates of PFC CUE cells that responded specifically to the spatial 
cue (Fig. 3). The average firing rate of these neurons for the preferred 
direction during the cue period was 26.7 + 4.4spikess * in young 
monkeys (7 = 12 neurons), and 25.3 + 3.7 spikess_' in old monkeys 
(11 neurons; t-test, P > 0.8; significant age < cell-type interaction in 
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Figure 3 | Firing rates of dorsolateral PFC CUE cells remain stable in aged 
monkeys. The average firing rates of CUE cells in young monkeys (left graph; 
10 neurons from a 7-year-old and 2 neurons from a 9-year-old monkey), did 
not differ from the firing rate in the oldest monkey (right graph; 11 neurons 
from a 21-year-old monkey), or from the firing rate averaged for both middle- 
aged (5 neurons from 13-year-old monkey, not shown separately) and old 
monkeys (t-test, P > 0.7). CUE cells may receive direct, ‘bottom-up’ excitation 
from parietal association cortex®, which may be less vulnerable to subtle 
molecular changes with advancing age. A subset of CUE cells recorded in the 
aged monkey showed high firing rates during the response period. Error 
envelope represents s.e.m. 


two-way ANOVA, P< 0.05). These data indicate that reductions in 
memory-related firing rate do not arise from generalized changes with 
advancing age affecting all neurons but, rather, are especially evident in 
recurrent circuits that must maintain firing in the absence of “bottom- 
up’ sensory stimulation. 

It is important to identify which changes in the ageing brain con- 
tribute to reduced firing during the delay period. There are many brain 
alterations associated with advancing age”, including decreased PFC 
grey matter volume”, focal changes in white matter” and dendritic 
spine loss*®, all of which correlate with cognitive decline. Importantly, 
spine loss is especially prominent in layer I1]—the layer where the 
recurrent excitatory networks reside—and thin-type spines are the 
most vulnerable in the aged PFC’. Immunoelectron microscopy indi- 
cates that thin spines have the greatest concentration of cAMP-HCN- 
channel signalling proteins, indicating that disinhibition of cAMP 


signalling with advancing age may weaken thin spines in particular’. 
Thus, we tested whether inhibition of cAMP signalling in the PFC 
could partially restore the working-memory-related firing of aged 
neurons, or whether reductions in firing were irreversible owing to 
immutable architectural changes in the aged brain. Drugs were applied 
near the recorded neurons using iontophoresis, whereby a small 
electrical current is applied to extrude charged molecules from glass 
pipettes attached to the recording electrode. Only a minute amount of 
drug was released, sufficient to alter the firing of nearby neurons, 
without altering behavioural performance. 

Agents that inhibit cAMP signalling or block HCN or KCNQ 
channels restored persistent firing during the delay period of the 
working memory task (Fig. 4). For example, iontophoresis of the 
A agonist, guanfacine (Fig. 4a, e), or the cAMP-PKA inhibitor, 
Rp-cAMPS (Fig. 4b, f), significantly increased firing during the delay 
period on trials when the cue had appeared at the neuron’s preferred 
direction. In contrast, the PDE4 inhibitor, etazolate—which increases 
cAMP signalling—further decreased neuronal firing in aged neurons 
(P < 0.001; Supplementary Fig. 5). We also tested whether blockade of 
HCN or KCNQ channels could restore firing, given that cAMP-PKA 
signalling increases the open state of these ion channels. As shown in 
Fig. 4c and g, a low dose of the HCN channel blocker ZD7288 signifi- 
cantly enhanced the delay-related firing rates of neurons in aged 
monkeys. KCNQ channels were also of interest, as in vitro physio- 
logical characterizations of PFC neurons in aged primates have found 
increases in the slow afterhyperpolarization, which is mediated in part 
by KCNQ channels”®. As shown in Fig. 4d and h, blockade of KCNQ 
channels with XE991 increased delay-related firing in aged PFC neu- 
rons. Thus, agents that reduced cAMP opening of HCN or KCNQ 
channels all restored firing rates to levels resembling those observed in 
younger monkeys. These findings are consistent with behavioural data 
showing that guanfacine and Rp-cAMPS can enhance working memory 
performance in aged animals when administered systemically (guanfa- 
cine) or directly into the rat PFC (guanfacine or Rp-cAMPS)”’ (see 
Supplementary Information). On the basis of these data, guanfacine 
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Figure 4 | Iontophoresis of compounds that inhibit cAMP-PKA signalling, 
or block HCN or KCNQ channel signalling, strengthens delay-related firing 
in aged PFC DELAY neurons. a-d, A summary of the results showing a 
significant increase in population-average firing rate for the neuron’s preferred 
direction compared to control conditions (paired t-test, P< 0.01 for a, c and 
d, and P< 0.05 for b) following iontophoresis of the «4 adrenergic agonist 
guanfacine applied at 10 nA (GFC; a, significant effects in 7 out of 9 neurons, 
t-test, P<0.05, indicated by solid lines); the cAMP inhibitor Rp-cAMPS at 
50 nA (b, significant in 4 out of 6 neurons); the HCN channel blocker, ZD7288 
at 15nA (¢, significant in 4 out of 7 neurons); and the KCNQ channel blocker 
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XE991 at 15nA (d, significant in 3 out of 6 neurons). In all cases, significant 
effects were found more frequently than expected by chance (binomial test, 
P<0.005). The orange lines represent the individual neurons shown in 

e-h. e-h, Individual examples of neurons under control conditions (top) firing 
to their preferred (blue trace) or anti-preferred (red trace) directions, compared 
to their firing patterns following iontophoresis of guanfacine (e), Rp-cAMPS 
(f), ZD7288 (g) or XE991 (h). The orange lines in a-d indicate the individual 
neurons shown in e-h. Error bars are s.e.m. *P < 0.05, **P < 0.01 significant 
difference between drug versus control for the neuron’s preferred direction. 
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is currently being tested in elderly humans with PFC cognitive deficits 
(http://www.clinicaltrials.gov: trial number NCT00935493). 

The current study revealed a physiological basis for age-related 
working memory decline in the primate brain, with a reduction of 
memory-related firing beginning in middle age and worsening with 
advancing age. This marked change in network physiology may render 
higher cortical circuits especially vulnerable to neurodegenerative 
processes such as Alzheimer’s disease. However, these studies also 
uncovered more hopeful data showing that restitution of the proper 
neurochemical environment can partially restore physiological 
integrity. These data establish that cognitive changes with advancing 
age are malleable, and that there is potential to restore at least some 
cognitive abilities in the elderly. Maintaining strong PFC physiology 
into advanced age will be an important advantage in an increasingly 
complex, ageing society. 


METHODS SUMMARY 


All experiments were performed in accordance with National Institutes for Health 
guidelines for animal research and were approved by the Yale Institutional Animal 
Care and Use Committee. 

Behavioural training. Monkeys were trained on the oculomotor delayed response 
(ODR) task (Fig. 1b) as reported elsewhere®, with special care to minimize all 
stress. The aged monkeys took longer to learn the task, rested more frequently 
than young monkeys during the testing sessions, and performed fewer trials each 
day than younger animals. A brief delay period (2.5 s) was chosen to ensure that all 
monkeys in the study performed above 85% correct during training, and could 
maintain high levels of performance during the study (Supplementary Fig. 6). 
Physiology and iontophoresis. Single unit recordings were made from the 
dorsolateral PFC surrounding the caudal portion of the principal sulcus 
(Fig. la). The recording procedures and iontophoresis methods have been 
described previously*. Single neuron activity was readily isolated (Supplemen- 
tary Fig. 7). Denoting the average activity during the 0.5s cue period and that 
during the 2.5 s delay period during the trials in preferred directions as C and D, 
neurons were classified as CUE cells, when the ratio D/C was >0.5, and as DELAY 
cells otherwise. Statistical analyses, including ANOVA and regression analyses, 
were performed using Matlab (MathWorks). The effect of age on firing rate during 
a series of 0.5s epochs was tested using a regression model, and its statistical 
significance with a t-test. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Oculomotor delayed response task. Studies were performed on four adult male 
rhesus monkeys (Macaca mulatta) trained on the spatial oculomotor delayed 
response (ODR) task as previously described*. This task requires the monkey to 
make a memory-guided saccade to a remembered visuospatial target. Each trial 
began when the subject fixated at the central spot for 0.5s (fixation period). 
Subsequently, a cue was illuminated for 0.5s at one of eight peripheral targets 
(cue period). After the cue was extinguished, a 2.5 s delay period followed. During 
the cue and delay periods, the monkey was required to maintain central fixation. 
At the end of the delay, the fixation spot was extinguished, instructing the monkey 
to make a memory-guided saccade to the previously cued location (saccade period). 
The monkey was rewarded with fruit juice immediately after every successful 
response. The position of the stimulus was randomized over trials such that it 
had to be remembered on a trial-by-trial basis. The subject’s eye position was 
monitored with the ISCAN Eye Movement Monitoring System, and the ODR task 
was generated by the TEMPO real-time system (Reflective Computing). The aged 
monkeys took longer to learn the task, rested more frequently than young monkeys 
during the testing sessions, and performed fewer trials each day than younger 
animals. A brief delay period (2.5) was chosen to ensure that all monkeys in the 
study performed above 85% correct during training, and could maintain high levels 
of performance during the study (Supplementary Fig. 6). 

Recording locus. Before recording, the animal underwent a magnetic resonance 
image (MRI) scan to obtain exact anatomical coordinates of brain structures, 
which guided the placement of the chronic recording chambers. MRI-compatible 
materials were used for the implant so that the position of the recording chambers 
could be confirmed by MRI after implantation. The site of recordings in the 
present study were located in an area ranging from 0-5 mm anterior to the caudal 
end of the principal sulcus and —2-2 mm medial to the principal sulcus. 
Pharmacology, physiology and data acquisition. Guanfacine, XE991 and 
ZD7288 (Tocris) and etazolate (Sigma) were dissolved at 0.01 M in triple-distilled 
water (adjusted with HCl to pH 3.5-4.0). Rp-cAMP (Sigma) was dissolved at 
0.01 M in triple-distilled water (adjusted with NaOH to pH 9). 

Iontophoretic electrodes were constructed with a 20-jtm pitch carbon fibre 
(ELSI) inserted in the central barrel of a seven-barrel non-filamented capillary 
glass (Friedrich and Dimmock). The assembly was pulled using a multipipette 
electrode puller (MicroData Instrument) and the tip was bevelled to obtain the 
finished electrode. Finished electrodes had impedances of 0.3-1.0 MQ (at 1 kHz) 
and tip sizes of 30-40 tum. The outer barrels of the electrode were then filled with 3 
drug solutions (two consecutive barrels each) and the solutions were pushed to the 


tip of the electrode using compressed air. A Neurophore BH2 iontophoretic sys- 
tem (Medical Systems) was used to control of the delivery of the drugs. The drug 
was ejected at currents that varied from 5-50 nA. Retaining currents of —3 to 
—5nA were used in a cyclical manner (1s on, 1s off) when not applying drugs. 
Drug ejection did not create noise in the recording, and there was no systematic 
change in either spike amplitude or time course at any ejection current. 

The electrode was mounted on a MO-95 micromanipulator (Narishige) in a 25- 
gauge stainless steel guide tube. The dura was punctured using the guide tube to 
facilitate access of the electrode to cortex. Extracellular voltage was amplified using 
an AC/DC differential amplifier (A-M systems; Model 3000) and band-pass filtered 
(180 Hz-6 Khz, 20 dB gain, 4-pole Butterworth; Kron-Hite). Signals were digitized 
(20.83 kHz, micro 1401, Cambridge Electronics Design) and acquired using the 
Spike2 software (CED). Neural activity was analysed using waveform sorting by a 
template-matching algorithm, which made it possible to isolate more than one unit 
at the same recording site. Post-stimulus time histograms and rastergrams were 
constructed online to determine the relationship of unit activity to the task. Unit 
activity was measured in spikes per second. Ifthe rastergrams displayed task-related 
activity, the units were recorded further and pharmacological testing was performed. 

Data were first collected from the cell under a control condition in which at least 

eight trials at each of eight cue locations were obtained. On establishing the 
stability of the cells’ activity, this control condition was followed by the drug 
application. Dose-dependent effects of the drug were tested in two or more con- 
secutive conditions. Drugs were continuously applied at a relevant current 
throughout a given condition. Each condition had ~8 (typically 10 or more) trials 
at each location for statistical analyses of effects. 
Data analysis. For purposes of data analysis, each trial in the ODR task was 
divided into four epochs: fixation, cue, delay and response (saccade). The fixation 
epoch lasted for 0.5s. The cue epoch lasted for 0.5s and corresponds to the 
stimulus presentation phase of the task. Delay lasted for 2.5s and reflects the 
mnemonic component of the task. The response phase started immediately after 
the delay epoch and lasted ~1.5s. 

Two-way analysis of variance, ANOVA, was used to examine the spatial tuned 
task-related activity with regard to (1) different periods of the task (cue, delay, 
response versus fixation); and (2) different cue locations. One-way ANOVAs were 
used to assess the effect of the drug application on cells showing delay-related 
activity. Statistical analyses, including ANOVA and regression analyses were per- 
formed using Matlab (MathWorks). The effect of age on firing rate during a series 
of 0.5 s epochs was tested using a regression model, and its statistical significance 
with a f-test. 
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Genetic risk and a primary role for cell-mediated 
immune mechanisms in multiple sclerosis 


The International Multiple Sclerosis Genetics Consortium* & the Wellcome Trust Case Control Consortium 2* 


Multiple sclerosis is a common disease of the central nervous 
system in which the interplay between inflammatory and neuro- 
degenerative processes typically results in intermittent neuro- 
logical disturbance followed by progressive accumulation of 
disability’. Epidemiological studies have shown that genetic factors 
are primarily responsible for the substantially increased frequency 
of the disease seen in the relatives of affected individuals’, and 
systematic attempts to identify linkage in multiplex families have 
confirmed that variation within the major histocompatibility 
complex (MHC) exerts the greatest individual effect on risk’. 
Modestly powered genome-wide association studies (GWAS)*° 
have enabled more than 20 additional risk loci to be identified 
and have shown that multiple variants exerting modest individual 
effects have a key role in disease susceptibility''. Most of the genetic 
architecture underlying susceptibility to the disease remains to be 
defined and is anticipated to require the analysis of sample sizes 
that are beyond the numbers currently available to individual 
research groups. In a collaborative GWAS involving 9,772 cases 
of European descent collected by 23 research groups working in 15 
different countries, we have replicated almost all of the previously 
suggested associations and identified at least a further 29 novel 
susceptibility loci. Within the MHC we have refined the identity 
of the HLA-DRB1 risk alleles and confirmed that variation in the 
HLA-A gene underlies the independent protective effect attri- 
butable to the class I region. Immunologically relevant genes are 
significantly overrepresented among those mapping close to the 
identified loci and particularly implicate T-helper-cell differenti- 
ation in the pathogenesis of multiple sclerosis. 

We performed a large GWAS as part of the Wellcome Trust Case 
Control Consortium 2 (WTCCC2) project. Cases were recruited 
through the International Multiple Sclerosis Genetics Consortium 
(IMSGC) and compared with the WT'CCC2 common control set'”!* 
supplemented by data from the control arms of existing GWAS. We 
introduced a number of novel quality control methods for processing 
these data sets (see Supplementary Information), which ultimately 
provided reliable information from 9,772 cases and 17,376 controls 
(Fig. 1a). After single nucleotide polymorphism (SNP)-based quality 
controls, data from 465,434 autosomal SNPs, common to all internally 
and externally generated data sets, were available for analysis. 

The multi-population nature of our study (Fig. la, b) afforded an 
opportunity to assess various published approaches for controlling the 
potential confounding effects of population structure, several of which 
(in the event) proved unhelpful (see Supplementary Information). 
Although not common in primary GWAS undertaken to date, the 
challenge of combining data across populations, in contexts where 
not all case samples have controls available from the same population 
(thus precluding standard meta-analytical techniques), may become 
more routine as study sizes increase. 

We attempted analyses of the non-United Kingdom (UK) data with 
the now widespread technique of using principal components as 
covariates to correct for structure. However, even use of all seven top 
principal components that captured genome-wide effects in our data 


resulted in an unacceptably high genomic inflation: for example, the 
genomic control factor’* (2) was A = 1.2. We tried to reduce the genomic 
inflation by discarding the case samples that seemed least well matched to 
control sets. Removal of half the available cases in this fashion only 
reduced / to 1.1. In another approach to handling structure, statistical 
clustering algorithms were successful in identifying subgroups of the data 
within which cases and controls seemed well matched for ancestry (see 
Supplementary Fig. 17). However, tests within these subgroups com- 
bined via fixed-effects meta-analysis also yielded unacceptably high 
genomic inflation (A >1.4) in an analysis with seven matched subgroups 
of cases and controls. Lastly, we applied a novel variance components 
method (similar to one described previously’*), separately to the UK and 
non-UK data sets, which explicitly accounts for correlations among the 
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Figure 1 | Distribution of cases and controls. a, b, All cases and controls were 
drawn from populations with European ancestry; cases from 15 countries and 
controls from 8.a, Numbers of case (red) and control (black) samples from each 
country. b, The projection of samples onto the first two principal components 
of genetic variation, with cases shown on the left and controls on the right. The 
axes are orientated to approximate the geography, and samples are colour 
coded as indicated in the legend. NZ, New Zealand. We genotyped the cases 
(9,772) and some Swedish controls (527) using the lumina Human 660-Quad 
platform, and the UK controls (5,175, the WT'CCC2 common control set'*"*) 
using the Illumina 1.2M platform. All other controls were genotyped externally 
using various Illumina genotyping systems (see Supplementary Information). 


*A list of authors and their affiliations appears at the end of the paper; membership of both consortia is listed in Supplementary Information. 
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phenotypes of individuals resulting from relatedness, allowing us to deal 
successfully with all sources of structure in our samples (see Sup- 
plementary Information for details of the linear mixed model we used). 
For example, the genomic inflation was reduced to 2 = 0.995 in the UK 
and 1.016 in the non-UK data (see also Supplementary Information). 
After fixed-effects meta-analysis of the results from the UK and non-UK 
data sets, the inflation factor was 2 = 1.045. We adopted this approach 
for all subsequent non-MHC association analyses. 

Outside the MHC we identified 95 distinct regions having at least 
one SNP associated with multiple sclerosis at Pewas < 1 X 10 *°; in 
six of these 95 regions conditional analysis revealed an additional SNP 
showing association to the same locus (one locus containing two such 
SNPs). In total we took all 102 SNPs forward to replication, which we 
performed using data from previously reported multiple sclerosis 
GWAS*” and the iControl database (excluding any WTCCC controls 
previously used in these studies). In total, the replication analysis 
included data from 4,218 cases and 7,296 controls. These were con- 
sidered in six independent strata after which results were combined 
through a fixed-effects meta-analysis. For 98 of the 102 SNPs, the same 
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allele was overrepresented in cases compared to controls. Twenty three 
of the 26 previously known or strongly suggested multiple-sclerosis- 
associated loci were replicated in our primary GWAS with 
Pgwas <1 107°. Our GWAS and replication also revealed another 
29 novel associated regions (defined as having Pewas < 1X 104°, 
one-sided Preptication < 0.05 and Prombinea < 5 X 10-8), and a further 5 
regions with strong evidence for association (with Pewas < 1 x 10 *°, 
one-sided Preptication < 0.05 and Pcombined < 5% 10 7). In one previ- 
ously reported locus and two novel loci, additional SNPs were identified 
as being conditionally important in explaining risk. Just over one third 
of the identified loci overlap with regions already confirmed as asso- 
ciated with at least one other autoimmune disease (according to the 
GWAS catalogue, http://www.genome.gov/gwastudies/). Results for 
both the previously established and novel loci are shown in Fig. 2 and 
Supplementary Tables 1-3; and details of all 102 SNPs taken to replica- 
tion are available in Supplementary Data. 

To assess objectively the collective evidence across the associated 
regions for particular classes of genes, we performed statistical analyses 
to look for enrichment of genes with similar function. We first identified 
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the nearest gene to the lead SNP in each of the (52) regions of asso- 
ciation and used the Gene Ontology (GO) database’* to define sets of 
functionally related genes (GO terms). We then tested whether the set 
of nearest genes was enriched for particular GO terms using Fisher’s 
exact test. The GO terms having the most significant enrichment 
include genes linked to lymphocyte function (P = 3.2 X 10 '’, odds 
ratio (OR) = 35.96) and in particular those with a role in T-cell activa- 
tion and proliferation (P = 1.85 X 10° °, OR = 40.85). These are rep- 
resentative of a larger group associated with various components of the 
GO ‘immune system process’ (P = 8.6 X 10 1’, OR = 9.12). A similar 
analysis based on all genes in or near association regions showed 
similar enrichment, as did independent analyses based on nearest gene 
or all genes in our next tier of signals, the 42 regions taken to replication 
but not meeting the thresholds above for association (see Supplemen- 
tary Data). Although GO immune system genes only account for 7% of 
human genes, in 30% of our association regions the nearest gene to the 
lead SNP is an immune system gene. As an illustration, Fig. 3 shows a 
schematic of genes involved in the T-helper-cell differentiation path- 
way; a notable number show strong evidence for association with 
multiple sclerosis, particularly those acting as cell surface receptors. 
We infer from this pathway analysis of our GWAS signals that specific 
classes of immune system genes are especially important in the patho- 
genesis of multiple sclerosis. 

Our screen not only implicates a multitude of genes coding for 
cytokine pathway (CXCR5, IL2RA, IL7R, IL7, IL12RB1, IL22RA2, 
IL12A, IL12B, IRF8, TNFRSF1A, TNFRSF14, TNFSF 14), co-stimulatory 
(CD37, CD40, CD58, CD80, CD86, CLECL1) and signal transduction 
(CBLB, GPR65, MALT1, RGS1, STAT3, TAGAP, TYK2) molecules of 
immunological relevance, but also relates to previously reported 
environmental risk factors such as vitamin D®”” (CYP27B1, CYP24A1) 
and therapies for multiple sclerosis including natalizumab'* (VCAM1) 
and daclizumab”’ (IL2RA). There is a relative absence of genes relevant 
to potential pathways for neurodegeneration independent of inflam- 
mation (GALC, KIF21B). 

To refine our understanding of the MHC associations in multiple 
sclerosis we imputed classical human leukocyte antigen (HLA) types at 
six loci (A, B, C, DQA1, DQB1 and DRB1)” and analysed these along- 
side the SNPs (see Supplementary Information for validation; at alleles 
responsible for the major signals described later, estimated specificity 
was at least 0.99 and sensitivity was at least 0.98, except for DRB1*13:03, 
where it was 0.88). Primary discovery was focused on the UK cohort 
with candidate signals being validated through support from addi- 
tional case-control cohorts. Because of the extensive linkage disequi- 
librium within the MHC, we identified associated alleles in a stepwise 
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manner, selecting the most strongly associated to include in a general 
model, in turn, if Pux <10 4 and Peombinea < 10°” (Supplementary 
Information). At each stage we explored possible interactions and 
departures from the simple model in which risk increases multiplica- 
tively with each additional copy of the relevant allele (additive increase 
on the log-odds scale) within the logistic risk framework. 

Using this approach we found that DRB1*15:01 has the strongest 
association with multiple sclerosis among all classical and SNP alleles, 
with a consistent effect between cohorts (P< 1X 10° °°; Fig. 4a). The 
data are consistent with an additive effect on the log-odds scale for each 
additional allele. Conditioning on DRB1*15:01, we confirmed the 
presence of a protective class I allele and identified the signal as being 
driven by HLA-A*02:01 (as previously suggested*’), with a consistent 
effect size across cohorts (P= 9.1 X 10 *°; Fig. 4a). Again, we found no 
strong evidence for departure from additivity on the log-odds scale or 
statistical interaction with DRB1*15:01. Conditioning on both 
DRB1*15:01 and A*02:01 revealed additional risk associated with the 
strongly linked alleles DRB1*03:01 and DQB1*02:01 (P = 3.6 X 10 "°; 
Fig. 4a; note that we cannot separate these alleles but for simplicity 
refer only to DRB1*03:01 later). Further conditioning identified an 
additional DRBI risk allele DRB1*13:03 (P= 1.310 ''; Fig. 4a). 
Although no other classical alleles meet the above criteria, we did 
observe several SNPs providing independent signals, the strongest com- 
ing from rs9277535_G (combined OR 1.28, P = 2.2 X 10 77), an allele 
known to be in linkage disequilibrium with DPB1*03:01 (1? = 0.37)”. 

Analysis of the MHC SNP data using a genealogical method 
(GENECLUSTER)” offers an alternative means of relating our results 
to classical HLA alleles that provides additional insight into the under- 
lying genetic architecture (see Supplementary Information). Figure 4b 
shows genealogical trees relating the classical alleles at DRB1 and HLA-A, 
together with the estimated evolutionary position of the mutations pre- 
dicted by GENECLUSTER, as most completely modelling the asso- 
ciation. At HLA-DRBI1, three mutations are predicted, each of which 
implicates a clade of haplotypes carrying particular DRB1 alleles. All of 
the DRB1 alleles we have shown to be independently associated are 
included in these clades, each corresponding to a particular mutation. 
In addition, the analysis also explains why those haplotypes carrying the 
*(8:01 allele have previously been shown to increase risk**” as they carry 
the same mutation as those bearing *13:03. At HLA-A, the predicted 
protective mutation is also concordant with our regression analysis of 
classical alleles in implicating *02:01 but, in addition, predicts that 
*68:01, *02:05 and *02:06 carry the same protective allele. All of these 
secondary predictions (increased risk from DRB1*08:01 and protection 
from HLA-A*68:01, *02:05 and *02:06) are supported in our regression 


T,17 cell Figure 3 | Graphic representation of the 
T-helper-cell differentiation pathway. The figure 
is derived from an image generated by Ingenuity 
Pathway Analysis (IPA) software version 8.8 
(Ingenuity Systems). Alphanumeric labels indicate 
the individual genes and gene complexes (nodes) 
included in the pathway (note that some are 
included more than once). Coloured nodes are 
those containing a gene implicated by proximity to 
an SNP showing evidence of association. Red, in 
bold or grey in Fig. 2 (plus MHC class II region and 
TNF); orange, other loci in Fig. 2 or discovery P 
value < 1X 10 *° and consistent replication data; 
yellow, discovery P value< 1X 10 *. Other 
molecules (proteins, vitamins etc) may also be of 
relevance in these processes but are not included 
here as they are not currently listed as being part of 
this particular pathway in the IPA database. 
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Figure 4 | Results for the main MHC alleles. a, Forest plots for each of the 
primary HLA alleles (HLA-A*02:01, DRB1*15:01, DRB1*03:01 and 
DRB1*13:03) showing consistency of effect across the populations and 
combined OR of 0.73, 3.1, 1.26 and 2.4, respectively (whiskers indicate 95% 
confidence intervals). b, The genealogical trees estimated for DRBI (top) and 
HLA-A (bottom). These trees were constructed using classical HLA and SNP 
typing data available from the HapMap CEU haplotype data. Each left-hand 
branch of the tree terminates on a set of haplotypes carrying a particular HLA 
allele. The coloured dots indicate the mostly likely locations for a disease- 
associated mutation as predicted by the GENECLUSTER program”. In the 


analysis of classical alleles but the power to detect them in the primary 
analyses is limited because each allele occurs at very low frequency. 

We found no evidence for genetic associations with clinical course, 
severity of disease or month of birth, and no evidence of interaction 
with gender or DRB1*15:01 in any part of the genome (see Sup- 
plementary Information). However, analysis with respect to age at 
onset replicated the previously suggested association with the 
DRB1*15:01 allele’®. Although no other part of the genome contained 
individual SNPs showing strong evidence for association, risk alleles 
determining susceptibility are collectively more closely associated with 
age at onset than expected by chance, indicating that individual genetic 
susceptibility is inversely correlated with age at onset. 

Our GWAS-—large for any complex trait having a prevalence of 
1:1,000 and involving diverse populations of European descent—has 
identified 29 novel susceptibility loci. Four mutations, one from class I 
and three from class II, with effects modelled in a simple multiplicative 
manner within and across loci are sufficient to account for most of the 
risk attributable to the MHC (see Supplementary Information). 
Although our data do not address the issue of which components 
within the nervous system are initially damaged by the inflammatory 
response, the overrepresentation of genes that influence T-cell 
maturation provides independent and compelling evidence that the 
critical disease mechanisms primarily involve immune dysregulation. 

More generally, our study reinforces the view that the GWAS design, 
combined with very large experimental sample sizes and careful 
statistical analysis, provides valuable insights into the genetic architec- 
ture of common complex diseases. Here, this approach has identified 
many associated genetic variants close to genes, which are both indi- 
vidually interesting and collectively illuminate the roles of key bio- 
logical pathways. It also provides indirect evidence that many more 
common variants of small effect contribute to genetic susceptibility for 
multiple sclerosis. Simple models, in which the previously known and 
newly identified variants affect risk multiplicatively, both within and 
across loci, explain a meaningful proportion (~20%, see Supplemen- 
tary Information) of genetic risk for the disease. Important challenges 
lie ahead in understanding overlap between the genetic basis for 
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DRB1 tree, the blue dot captures a risk effect attributable to all haplotypes 
carrying the *15:01 allele. The green dot captures a risk effect carried by all 
haplotypes carrying the *03:01 allele and the red dot captures a risk effect on 
haplotypes carrying *13:03 or *08:01. In the HLA-A plot, the orange dot is a 
protective mutation lying at the root of all *02:01, *02:05, *02:06 and *68:01 
alleles. The blue dot in brackets denotes a branch containing those *03:01 
haplotypes that also carry DRB1*15:01; the GENECLUSTER prediction here is 
thus a reflection, due to linkage disequilibrium of the risk attributable to 
DRB1*15:01. The terminal branches are labelled with the allele carried by the 
haplotype and its frequency. 


susceptibility in the context of different autoimmune diseases, and in 
uncovering the functional mechanisms underlying these associations. 


METHODS SUMMARY 


Details of case ascertainment, processing and genotyping, together with sample 
and genotyping quality control are provided in Supplementary Information. 
Statistical methods developed for testing the reliability of externally generated data 
sets, detecting samples with non-European ancestry, correcting for structure, 
classical HLA imputation and meta-analysis are also outlined in Supplementary 
Information. Results for all scans and all reported loci are described in detail in 
Supplementary Information. 
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Induction of human neuronal cells by defined 


transcription factors 


Zhiping P. Pang'*, Nan Yang**, Thomas Vierbuchen***, Austin Ostermeier”’, Daniel R. Fuentes’, Troy Q. Yang’, Ami Citri*, 


Vittorio Sebastiano’, Samuele Marro*, Thomas C. Stidhof!* 


Somatic cell nuclear transfer, cell fusion, or expression of lineage- 
specific factors have been shown to induce cell-fate changes in 
diverse somatic cell types’. We recently observed that forced 
expression of a combination of three transcription factors, Brn2 
(also known as Pou3f2), Ascl1 and Mytll, can efficiently convert 
mouse fibroblasts into functional induced neuronal (iN) cells'*. 
Here we show that the same three factors can generate functional 
neurons from human pluripotent stem cells as early as 6 days after 
transgene activation. When combined with the basic helix-loop- 
helix transcription factor NeuroD1, these factors could also con- 
vert fetal and postnatal human fibroblasts into iN cells showing 
typical neuronal morphologies and expressing multiple neuronal 
markers, even after downregulation of the exogenous transcription 
factors. Importantly, the vast majority of human iN cells were able 
to generate action potentials and many matured to receive synaptic 
contacts when co-cultured with primary mouse cortical neurons. 
Our data demonstrate that non-neural human somatic cells, as well 
as pluripotent stem cells, can be converted directly into neurons by 
lineage-determining transcription factors. These methods may 
facilitate robust generation of patient-specific human neurons 
for in vitro disease modelling or future applications in regenerative 
medicine. 

Encouraged by our recent findings in mouse cells'’, we explored in 
this study whether human fibroblasts could also be directly induced to 
become functional neurons. This was unclear given the differences in 
the gene regulatory networks governing human and rodent neural 
development’*"'®. First, we sought to determine whether forced 
expression of transcription factors could induce a neuronal fate in 
human pluripotent cells. To that end, we infected undifferentiated 
human embryonic stem (ES) cells in chemically defined N3 media’’ 
with Brn2, Ascl1 and Myt1l (BAM) using doxycycline (dox)-inducible 
lentiviral vectors together with an EGFP (enhanced green fluorescent 
protein) virus. The majority of ES cells were EGFP-positive 24h after 
addition of doxycycline (Supplementary Fig. 1). Strikingly, as early as 
3 days after doxycycline treatment, we observed bipolar neuron-like 
cells surrounding nearly all ES cell colonies (Fig. 1a and Supplementary 
Fig. 1). By day 8, cells with more mature neuronal morphologies that 
expressed both B-III-tubulin (Tujl, also known as TUBB3) and MAP2 
had migrated away from ES cell colonies and were present throughout 
the plate (Fig. 1b, c). In contrast, after infection with EGFP virus alone, 
no neuronal cells were generated during the same timeframe, and 
nearly all ES cells had died due to the cytosine B-D-arabinofuranoside 
(Ara-C) treatment. We then determined the relative contribution of 
the three factors and found that Ascl1 alone was sufficient to induce 
MAP2-positive cells (Supplementary Fig. 2). The addition of Brn2 or 
Mytll or both did not increase the efficiency of neuronal differenti- 
ation but induced more complex morphologies. Cells infected with all 
three factors together had the most mature neuronal morphologies 
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(Supplementary Fig. 2). Electrophysiological analysis surprisingly 
revealed that as early as 6days after induction all recorded cells 
(n=16) generated action potentials (Fig. 1d). At day 15 after 
doxycycline treatment, the average resting membrane potential of 
neuronal cells was —51+1.8mV (mean+s.e.m., n= 18) (Fig. lf 
and Supplementary Table 1). These ES-iN cells exhibited prominent 
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Figure 1 | Rapid generation of functional neurons from human ES cells. 
a, Three days after induction, ES-iN cells showed bipolar neuronal 
morphologies. b, c, Eight days after induction, ES-iN cells expressed Tujl 

(b) and MAP2 (c). d, Spontaneous action potentials presumably caused by 
membrane potential fluctuations recorded from an ES-iN cell 6 days after 
induction. Arrow, pronounced after hyperpolarization potentials (AHP). 

e, Representative traces of action potentials in response to step current 
injections 15 days after induction. Membrane potential was maintained at 
approximately -63 mV. f, Quantification of intrinsic membrane properties in 
control ES cells (0 day) before and after viral transduction. Rin, membrane 
input resistance; RMP, resting membrane potential; Cm, capacitance. Scale 
bars, 10 pm (a, b, c). Numbers of cells recorded are labelled in the bars. Note the 
heterogeneity of the parameters (see also Supplementary Fig. 1). Data are 
presented with mean + s.e.m. ***P < 0.05. 
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after-hyperpolarization potentials (AHPs) following action potentials 
(Fig. 1d, f). Similar findings could be observed when human induced 
pluripotent stem (iPS) cells were infected (Supplementary Fig. 3). 
Thus, the BAM factors rapidly induce neuronal differentiation of 
human pluripotent stem cells. 

Next, we asked whether human fibroblasts could also be directly 
converted into neurons. To this end, we derived three independent 
primary human fetal fibroblast lines (HFFs) (see Methods) and per- 
formed an extensive characterization of these cultures in various 
growth conditions to confirm that they lack spontaneous neuronal 
differentiation potential and do not contain detectable amounts of 
neural crest stem cells (see Supplementary Fig. 4). Strikingly, 7-10 days 
after infection with the BAM factors we could detect cells with imma- 
ture neuronal morphologies. These cells expressed Tujl (Supplemen- 
tary Fig. 5a), but remained functionally immature as revealed by their 
inability to generate action potentials 20 days after doxycycline treat- 
ment (Supplementary Fig. 5b). Thus, the BAM factors seemed to 
induce neuronal features but were insufficient to generate functional 
neurons from human fetal fibroblasts under these conditions. 

Therefore, we screened 20 additional factors that could improve the 
generation of neuronal cells in combination with the BAM pool. We 
observed that NeuroD1, another basic helix-loop-helix transcription 
factor, improved the efficiency of generating Tuj1-positive neuronal cells 
two to threefold after 3 weeks (Fig. 2a). To determine the relative contri- 
bution of NeuroD 1, we tested various combinations of these four factors. 
NeuroD1 alone had no effect, but surprisingly in combination with Brn2 
it was sufficient to generate a similar number of Tujl-positive neuronal 
cells compared to the BAN (Brn2, Ascll and NeuroD1) , BMN (Brn2, 
Mytil and NeuroD1) and BAMN (Brn2, Ascl1l, Mytil and NeuroD1) 
pools (Supplementary Fig. 6a). However, further morphological and 
functional characterization showed that the BAMN combination 
generated the most mature neuronal cells (Supplementary Fig. 6b). We 
therefore decided to focus the further analysis on BAMN-iN cells. 

Two weeks after induction, BAMN-iN cells showed neuronal 
morphologies and were labelled with pan-neuronal antibodies such 
as anti-Tujl, anti-NeuN (also known as Rbfox3), anti-PSA-NCAM 
(polysialylated neural cell adhesion molecule) and anti-MAP2 
(Fig. 2b-f). After extended culture periods of 4-5 weeks, we could detect 
cells expressing neurofilaments (Supplementary Fig. 7a), and rare neur- 
onal processes decorated with punctate staining of synapsin and synap- 
totagmin, two synaptic vesicle proteins (Fig. 2g, h and Supplementary 
Fig. 7b). To ensure the co-expression of pan-neuronal and subtype 
specific markers, we performed single-cell gene-expression profiling 
of iN cells using Fluidigm dynamic RT-PCR arrays'*. We analysed 
54 single HFF-iN cells 34 days after doxycycline treatment from two 
independently infected cultures (Supplementary Fig. 8). These data 
revealed robust co-expression of multiple pan-neuronal and synaptic 
markers in 50/54 HFF-iN cells (f-II]-tubulin, DCX, MAP2, NCAM, 
synapsin). Over half (29 of 54) of the iN cells analysed expressed 
mRNAs typical for glutamatergic neurons, such as vGLUT1, vGLUT2 
(also known as SLC17A7 and SLC17A7, respectively) or both (Fig. 2i). 
Only two iN cells expressed GAD67 (also known as Gad1) in the 
absence of vGLUT] or 2, and no iN cell expressed the inhibitory marker 
gene vGAT (also known as SLC32A 1) . Five cells expressed the catecho- 
laminergic neuron marker tyrosine hydroxylase. Immunofluorescence 
analysis revealed that 5 weeks after infection 17 + 8% of iN cells 
expressed the forebrain marker Tbr1, 21 + 9% expressed the marker 
of peripheral neurons peripherin, whereas En1, a marker of midbrain 
neurons, serotonin and choline acetyltransferase were not detectable 
(Supplementary Fig. 9). 

To assess whether the iN cell state was stable without continued 
transgene expression, we monitored the mRNA expression levels of 
endogenous and exogenous BAMN genes before and after doxycycline 
induction and after doxycycline withdrawal. Whereas the exogenous 
transgenes were clearly doxycycline-dependent, the four correspond- 
ing endogenous genes were rapidly induced and exhibited increasing 
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Figure 2 | NeuroD1 increases reprogramming efficiency in primary human 
fetal fibroblasts. a, Quantification of Tujl-positive BAM-iN cells with 
indicated factors, 3 weeks after doxycycline treatment. b, c, Three weeks after 
doxycycline treatment BAM+NeuroD1 iN cells showed neuronal 
morphologies (b) and expressed Tujl (c) . d-f, iN cells expressed NeuN 

(d), PSA-NCAM (e), and MAP2 (f) 2 weeks after doxycycline treatment. 

g, h, AniN cell expressing MAP2 (g) and synapsin (h) 4 weeks after doxycycline 
treatment and co-cultured with primary astrocytes. i, Single-cell gene 
expression profiling using Fluidigm dynamic arrays. Rows represent the 
evaluated genes and columns represent individual cells. Heat map (blue to red) 
represents the threshold C, values as indicated. Data in (a) are presented as 
mean = s.d. Scale bars, 100 [tm (b, c) and 10 um (d-h). 


expression levels over time even after doxycycline withdrawal (Sup- 
plementary Fig. 10). Similarly, HFF-iN cells could be maintained in the 
absence of doxycycline for 3 weeks (Supplementary Fig. 11). 

We next asked whether iN cells generated from HFFs had active mem- 
brane properties. iN cells were identified by EGFP fluorescence (Fig. 3b) 
and whole-cell recordings were performed 14-35 days after doxycycline 
treatment. Two to three weeks after addition of doxycycline the average 
resting membrane potential of HFF-iN cells was —52.2+2.2mV 
(mean + s.em., n= 41). When HFF-iN cells were step-depolarized, 
action potentials could be detected in many iN cells at 14-25 days, and 
in all recorded iN cells at days 34-35 (Fig. 3c, Supplementary Figs 7 and 12 
and Supplementary Table 1). Fast-activating and inactivating inward 
Na‘ currents as well as outward K* currents were also observed (Fig. 3d). 
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Figure 3 | Membrane properties of fibroblast iN cells. a, Quantification of 
Tujl-positive neuronal cells from HFFs (line HFF-A) 3 weeks after doxycycline 
treatment or HPFs (line HPF-B) 4 weeks after doxycycline treatment. n = 3 
independent experiments. b, Patch clamp recording was conducted on HFF-iN 
cells identified by EGFP fluorescence and differential interference contrast 
microscopy. c, Representative traces of membrane potentials in response to step 
current injections (lower panel) from an HFF-iN cell 19 days after doxycycline 
treatment. Membrane potential was maintained at approximately —63 mV. 
d, Representative traces of membrane currents recorded with a ramp protocol 
(lower panel). Fast activating and inactivating Na* currents were prominent. 
Three traces are shown superimposed. e-g, HPF-iN cells express Tujl (red) and 
NeuN (green) (e), neurofilament (green, NF) (f) and MAP2 (green) 

(g). h, Representative traces of membrane potentials in response to step current 
injections in HPF-iN cells. Action potentials were generated in cultures without 
glia. i, Representative traces of membrane currents recorded following a ramp 
protocol (lower panel) in HPF-iN cells. The Na™ currents could be blocked by 
tetrodotoxin (TTX). Data in a are presented as mean + s.d. Scale bars, 10 tm 
(a, e-g). 

To determine whether the BAMN factors were also capable of con- 
verting more mature human fibroblasts into iN cells, we derived primary 
human postnatal fibroblasts (HPFs) from three different perinatal fore- 
skin resections. In all three HPF lines, expression of BAMN factors 
reproducibly generated neuron-like cells with co-expression of multiple 
pan-neuronal markers (Fig. 3e-g). Intriguingly, the efficiencies of iN 
cell generation from fetal and postnatal fibroblasts were similar (2-4% 
of cells plated; Fig. 3a). Single-cell gene-expression profiling of iN cells 
revealed that 46 of 51 HPF iN cells co-expressed pan-neuronal and 
synaptic markers 42 days after infection; the majority of the HPF iN 
cells (37 of 51) seem to be glutamatergic neurons. Immunofluorescence 
analysis showed that 6 weeks after infection, 81+ 17% of iN cells 
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Figure 4 | Synaptic responses of HFF-iN cells. a, An HFF-iN cell expressing 
EGFP co-cultured with mouse cortical neurons at day 35 after doxycycline 
treatment. b, Synapsin-positive puncta co-localize with neurites extending 
from HFF-iN cells (arrow heads). c, Thirty-five days after doxycycline 
treatment, spontaneous PSCs were recorded in HFF-iN cells. d, Slow responses 
could be blocked by picrotoxin (PTX). Bursting events of EPSCs were recorded 
in the presence of PTX. The insert shows the fast kinetics of the responses. e, In 
the presence of PTX and CNQX (both 50 [1M), no spontaneous activities were 
observed. f, Evoked postsynaptic responses. Four traces were superimposed. 
g, In the presence of PTX, electric stimulation evoked fast-kinetic excitatory 
PSCs (EPSCs). h, No evoked synaptic responses were observed in the presence 
of PTX and CNQX. Scale bars, 100 tm (a) and 10 pm (b). 


expressed Tbr1 above the levels of fibroblasts, and 15.2 + 6.6% were 
peripherin-positive (Supplementary Fig. 13). Unlike fetal fibroblasts, 
most postnatal fibroblasts showed weak but specific Tbr1 staining 
(Supplementary Fig. 13a—d). Electrophysiological recordings demon- 
strated the presence of regenerative action potentials as well as voltage- 
dependent channel activities in the majority of cells analysed from two 
lines (for example, 17 of 18 cells from line HPF-B, Fig. 3h, i and 
Supplementary Table 1). Furthermore, iN cells with active membrane 
properties could be generated from dermal fibroblasts derived from an 
11-year-old human subject (Supplementary Fig. 14). 

Finally, we determined whether human iN cells can express func- 
tional neurotransmitter receptors and form functional synapses. 
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Application of either GABA (y-aminobutyric acid) or L-glutamate to 
HFF-iN cells induced current responses that could be blocked by 
picrotoxin and CNQX, respectively (Supplementary Fig. 12e, f). We 
then dissociated HFFs 4-7 days after infection with the BAMN factors 
and EGFP and plated them onto previously established mouse cortical 
neuronal cultures. These co-cultures were maintained for up to 
5 weeks thereafter. HFF-iN cells were identified by the EGFP expres- 
sion (Fig. 4a). Whole-cell recordings after 2-3 weeks of co-culture 
showed no synaptic activity (n = 20) but after 4-5 weeks approxi- 
mately half of human iN cells recorded showed spontaneous post- 
synaptic currents (PSCs) with variable kinetics (n = 21, Fig. 4c and 
Supplementary Table 1). Immunostaining with synapsin antibodies 
confirmed the presence of scattered synaptic puncta on the dendrites 
of EGFP-positive cells (Fig. 4b). When the GABA, receptor inhibitor 
picrotoxin was applied, the majority of the spontaneous PSCs were 
blocked, demonstrating that they were inhibitory (IPSCs) (Fig. 4d). In 
the presence of picrotoxin, bursting of spontaneous excitatory post- 
synaptic currents (EPSCs) was shown and could be blocked by the 
AMPA (a-amino-3-hydroxy-5-methyl-4-isoxazole propionic acid) 
receptor blocker CNQX (Fig. 4e). Focal stimulation evoked both 
IPSCs and EPSCs that could be blocked by picrotoxin and CNQX 
(Fig. 4f-h). Importantly, PSCs could also be recorded from HPF-iN 
cells co-cultured with mouse cortical neurons 4 weeks after infection 
(Supplementary Fig. 15). These results demonstrate that fetal and 
postnatal fibroblast-derived iN cells could form functional synapses 
and integrated into pre-existing neuronal networks. 

In this report, we have identified a combination of transcription 
factors that are capable of converting human fibroblasts directly into 
neurons. Like mouse iN cells’? and neurons derived from ES cells'?”" 
and iPS cells****, the human iN cells seem relatively immature, as 
indicated by their slightly depolarized membrane potentials and the 
relatively low-amplitude synaptic responses. Compared to mouse iN 
cells, human iN cells required longer culture periods to develop syn- 
aptic activity. Future studies will be necessary to thoroughly optimize 
conditions for human iN cell generation and maturation, which would 
facilitate applications of this method for the study of human neuronal 
development and disease. 


METHODS SUMMARY 

Cell culture. H9 human ES cells (WiCell Research Resources) and iPS cells were 
expanded in mTeSR1 (Stem Cell Technologies) and passaged as clumps or single 
cells**. Primary HFFs were isolated from the distal half of the limbs of GW8-10 
fetuses obtained from Advanced Bioscience Resources Inc. Primary HPFs were 
established from foreskin. Primary mouse cortical cultures and glial monolayer 
cultures were established as described previously’. 

Lentiviral infections. Lentiviral production and fibroblast infections were per- 
formed as described previously’. Primary fibroblasts and pluripotent stem cells 
were infected with concentrated lentivirus and treated with doxycycline (2 1g 
ul!) 16-24h later. 

Electrophysiology and expression analysis. Cells were analysed by immuno- 
fluorescence and electrophysiology as described elsewhere'*”*. Single-cell gene 
expression profiling was performed using the Fluidigm Biomark dynamic 
array'*”® according to the manufacturer’s instructions. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Cell culture. H9 human ES cells (WiCell Research Resources) were expanded in 
mTeSRI (Stem Cell Technologies). Induced pluripotent stem cells were generated as 
described elsewhere”’. The day before infection, cells were treated with Accutase and 
seeded as single cells in 3.5-cm tissue culture dishes on Matrigel in mTeSR1 contain- 
ing 2 1M thiazovivin (Bio Vision)**. To inhibit the growth of uninfected ES cells and 
select for post-mitotic neurons, we added 441M cytosine f-p-arabinofuranoside 
(Ara-C) to the media 48h after doxycycline addition. Primary human fetal fibro- 
blasts were isolated from the distal half of the limbs of 8-10-week-old fetuses 
(Advanced Bioscience Resources Inc.). The tissue was dissociated after trypsin diges- 
tion and plated in MEF media (DMEM high glucose, calf serum, sodium pyruvate, 
non-essential amino acids, penicillin/streptomycin and {-mercaptoethanol). 
Primary human postnatal fibroblasts (HPFs) were established from dissociated fore- 
skin tissue derived from 1-3-day-old newborns. Before being used for experiments, 
primary fibroblast cells were passaged at least three times. Primary mouse cortical 
cultures and glial monolayer cultures were established as described previously’’. To 
maintain the iN cell culture, cells were either grown in N3 medium (DMEM/F2 
(Invitrogen), apotransferrin (100 ug ml~ 1) insulin (5 Lug ml 1) sodium selenite 
(30 nM), progesterone (20 nM), putrescine (100 nM), penicillin/streptomycin) sup- 
plemented with neurotrophic factors including brain-derived neurotrophic factor, 
glial-cell-derived neurotrophic factor, neurotrophin-3 and ciliary neurotrophic fac- 
tor (R&D systems), or dissociated using papain and replated onto previously estab- 
lished monolayer culture of primary mouse glia or primary neurons from mouse 
cortex in neuronal growth medium (MEM (Gibco) supplemented with B27 (Gibco), 
glucose (5 mg ml’), transferring (10 pg ml"), 5% fetal bovine serum and Ara-C 
(2 uM, Sigma)’***. 

Virus infections. Lentiviral production and fibroblast infections were performed 
as described previously’*. Briefly, primary human fetal or postnatal fibroblasts 
were plated and infected with concentrated lentiviral particles and polybrene (8 11g 
ul~') in fresh MEF medium. Viral medium was removed after 16-24h and 
replaced with MEF medium containing doxycycline (2 ug pl). After 24-48 h, 
medium was changed to N3 medium containing doxycycline (2 gy’). For 
human ES cell infections, HD human embryonic stem cells were switched into 
N3 medium containing polybrene (2 pg pl ') 24h after re-plating, and concen- 
trated lentiviral particles were added. After 16-24 h, cultures were switched to N3 
medium containing doxycycline (2 1g pul +) and changed daily before dissociation. 
Forty-eight hours after the initial addition of doxycycline, Ara-C (4 1g pl ') was 
added to the medium to inhibit proliferation of uninfected ES cells until analysis 
6days after the addition of doxycycline. All chemicals were purchased from 
Sigma-Aldrich if not otherwise specified. 

Single-cell gene expression analysis (Fluidigm dynamic array). Single-cell gene 
expression profiling was performed using the Fluidigm Biomark dynamic array 
according to the manufacturer’s protocol’*”*. Briefly, single cells growing on cul- 
ture dishes after 5 or 6 weeks of transduction were collected by aspiration into 
patch electrodes and ejected into 2 X cellsdirect buffer (Invitrogen), flash-frozen 
and kept at —80°C until further processing. Thawed cells were subject to target- 
specific reverse-transcription and 18 cycles of PCR pre-amplification with a mix of 
primers specific to the target genes (STA). STA products were then processed for 
real-time PCR analysis on Biomark 48:48 Dynamic Array integrated fluidic circuits 
(Fluidigm). To ensure the specificity of the amplification, titrations of total haman 
brain RNA were included in each experiment, and only primers that demonstrated 
a linear amplification were analysed. Furthermore, melting curves of the PCR 
products were compared between the single cells and the control RNA to ensure 
the specificity of the PCR products. 


Electrophysiology. Action potentials were recorded with current-clamp whole- 
cell configuration. The pipette solution for current-clamp experiments contained 
(in mM): 123 K-gluconate, 10 KCl, 1 MgCl,, 10 HEPES, 1 EGTA, 0.1 CaCl, 1 
K,ATP, 0.2 Na,GTP and 4 glucose, pH adjusted to 7.2 with KOH. Membrane 
potentials were kept around —65 to —70 mV, and step currents were injected to 
elicit action potentials. For whole-cell voltage-dependent current recordings, the 
same internal solution as aforementioned was used. For synaptic functional evalu- 
ation, the internal solution contained (in mM): 135 CsCl, 10 HEPES, 1 EGTA, 4 Mg- 
ATP, 0.4 Na,GTP, and 10 QX-314, pH 7.4. The bath solution contained (in mM): 
140 NaCl, 5 KCl, 2 CaCl, 2 MgCl, 10 HEPES, and 10 glucose, pH 7.4. Synaptic 
responses were measured as described previously’*”*. Stimulus artefacts for evoked 
synaptic responses were removed for graphic representation. Electrophysiological 
data are presented as mean + s.e.m. All statistical comparisons were made using 
Student’s t-test. 

Immunofluorescence and RT-PCR. For immunofluorescence experiments, cells 
were fixed in 4% paraformaldehyde in PBS for 10 min at room temperature. After 
fixation, cells were incubated in 0.2% Triton X-100 in PBS for 5 min at room tem- 
perature. After washing twice with PBS, cells were blocked in a solution of PBS 
containing 4% BSA and 1% Cosmic calf serum (CCS) for 30 min at room temper- 
ature. Primary and secondary antibodies were applied for 1h and 30 min, respect- 
ively. The following antibodies were used for our analysis: rabbit anti-Tujl (Covance, 
1:1,000), mouse anti-Tujl (Covance, 1:1,000), mouse anti-MAP2 (Sigma, 1:500), 
mouse anti-NeuN (Millipore, 1:200), mouse anti-neurofilament (Developmental 
Studies Hybridoma Bank (DSHB), 2H3a, 1:1,000), rabbit anti-synapsin (E028, 
1:1,000), anti-synaptotagmin (Synaptic systems, 41.1, 1:2,000), guinea pig anti- 
vGLUT1 (Millipore, 1:2,000), mouse anti-GAD6 (DSHB, 1:500), rabbit anti-Tbr1 
(Abcam, 1:200), mouse anti-peripherin (Sigma, 1:100), sheep anti-tyrosine hydro- 
xylase (Pel-Freez, 1:500), rabbit anti-GFAP (DAKO, 1:4,000), mouse anti-Sox2 (R&D 
Systems, 1:50), goat anti-Brn2 (clone C-20, Santa Cruz Biotechnology, 1:100), rabbit 
anti-Ascll (Abcam, 1:200), mouse anti-BrdU (Becton Dickinson, 1:50), mouse 
anti-LU5 (Abcam, 1:200), goat anti-Soxl0 (Santa Cruz Biotechnology, 1:40). 
Fluorescein isothiocyanate (FITC)-, and Cy3-conjugated secondary antibodies were 
obtained from Jackson Immunoresearch. Alexa-488-, Alexa-546- and Alexa-633- 
conjugated secondary antibodies were obtained from Invitrogen. 4’,6-Diamidino- 
2-phenylindole (DAPI) was from Sigma (1:10,000). For RT-PCR analysis, RNA 
was isolated using an RNAqueous Kit (Applied Biosystems) following the manu- 
facturer’s instructions, treated with DNase (Applied Biosystems) and reverse- 
transcribed with Superscript III (Invitrogen). 

Efficiency calculation. The following method was used to calculate the efficiency 
of neuronal induction. The total number of Tujl-positive cells with a neuronal 
morphology, defined as cells having a circular, three-dimensional appearance that 
extend a thin process at least three times longer than their cell body, were quan- 
tified at indicated time points. We determined this number in at least 15 randomly 
selected X20 visual fields on a Zeiss Axio Observer microscope. The average and 
standard deviation per field was determined and then used to extrapolate the total 
number of iN cells present in the entire dish based the known surface areas of a 
X20 visual field and the respective culture dish. We then divided this number by 
the number of cells plated before infection to get the percentage of the starting 
population of cells that adopted neuron-like characteristics. In multiple independ- 
ent experiments we verified that this extrapolation method yields cell numbers 
very similar to those measured by a haemocytometer at the time of plating. Data 
are presented as mean = s.d. 


27. Somers, A. et al. Generation of transgene-free lung disease-specific human 
induced pluripotent stem cells using a single excisable lentiviral stem cell cassette. 
Stem Cells 28, 1728-1740 (2010). 


©2011 Macmillan Publishers Limited. All rights reserved 


1 sid il Be 


doi:10.1038/nature10284 


Direct generation of functional dopaminergic 
neurons from mouse and human fibroblasts 


Massimiliano Caiazzo', Maria Teresa Dell’Anno!*, Elena Dvoretskova7*, Dejan Lazarevic?*, Stefano Taverna’, Damiana Leo”, 
Tatyana D. Sotnikova’, Andrea Menegon’, Paola Roncaglia‘*, Giorgia Colciago', Giovanni Russo”, Piero Carninci®, Gianni Pezzoli’, 
Raul R. Gainetdinov’, Stefano Gustincich*®, Alexander Dityatev? & Vania Broccoli! 


Transplantation of dopaminergic neurons can potentially improve 
the clinical outcome of Parkinson’s disease, a neurological disorder 
resulting from degeneration of mesencephalic dopaminergic neu- 
rons’*. In particular, transplantation of embryonic-stem-cell- 
derived dopaminergic neurons has been shown to be efficient in 
restoring motor symptoms in conditions of dopamine defi- 
ciency**. However, the use of pluripotent-derived cells might lead 
to the development of tumours if not properly controlled*. Here we 
identified a minimal set of three transcription factors—Mash1 
(also known as Ascl1), Nurrl (also known as Nr4a2) and 
Imx1a—that are able to generate directly functional dopaminergic 
neurons from mouse and human fibroblasts without reverting to a 
progenitor cell stage. Induced dopaminergic (iDA) cells release 
dopamine and show spontaneous electrical activity organized in 
regular spikes consistent with the pacemaker activity featured by 
brain dopaminergic neurons. The three factors were able to elicit 
dopaminergic neuronal conversion in prenatal and adult fibro- 
blasts from healthy donors and Parkinson’s disease patients. 
Direct generation of iDA cells from somatic cells might have sig- 
nificant implications for understanding critical processes for neur- 
onal development, in vitro disease modelling and cell replacement 
therapies. 

Seminal studies have demonstrated that functional neurons can be 
generated independently of stem cells by direct cell conversion through 
genetics-based approaches®. More recently, in a set of elegant experi- 
ments, fibroblasts have been directly converted into neuronal cells 
(iNs) by the forced expression of the three neurodevelopmental factors 
Mash1, Brn2 (also known as Pou3f2) and Myt1 ’. However, iNs rep- 
resent a heterogeneous population of glutamatergic and GABAergic 
neurons and their degree of global reprogramming remains to be 
properly characterized. It is thus unclear whether a specific neuronal 
subtype can be preferentially induced from direct reprogramming of 
heterologous cells. Therefore, we aimed to generate dopaminergic neu- 
rons through the direct conversion of somatic cells by forced expres- 
sion of lineage-specific factors that act during brain development*”. 
Initially, we transduced mouse embryonic fibroblasts (MEFs) from 
TH-GFP transgenic mice’® with a mixture of doxycycline (dox)- 
inducible lentiviruses expressing all selected factors (11 dopaminergic 
and 3iN; Supplementary Table 1) or with DsRed retrovirus (negative 
control) (Fig. la-d). We did not observe any GEP* cells in MEEs 10 
days after Ds-Red retrovirus infection or in culture without any viral 
infection (Fig. 1d, inset). In contrast, transduction of all factors resulted 
in the generation of a small number of bright GFP cells (1.8 + 0.8%) 
(Supplementary Fig. 1d-f). We next sought to determine the minimal 
set of genes required for dopaminergic neuronal induction. Given its 
essential role as a proneural gene during neurogenesis, Mash1 was 


introduced into MEFs together with each other single dopaminergic 
factor. Reporter gene expression was elicited only when Mash1 was 
combined with Nurr1, a critical determinant of dopaminergic neuronal 
specification and survival during development and in adulthood". 
However, Mash1/Nurr1 combined activation elicited a modest increase 
of GEP* cells (8 + 2%) (Supplementary Fig. 1g-i). Therefore, we added 
a third molecule of the 12 remaining and scored for the rate and mor- 
phology of GFP* cells in each combination. Surprisingly, only Lmxla 
and in part Lmx1b (18 + 3% versus 13 + 3% of GFP™ cells, respec- 
tively) were able to synergize with Mash1/Nurr1, robustly increasing 
the generation of GFP * cells with an evidently complex neuronal mor- 
phology (Fig. 1h and Supplementary Fig. 1s-y). Using the Mash1/ 
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Figure 1 | Mash1, Nurrl and Lmx1a reprogram mouse fibroblasts into iDA 
cells. a, b, TH and GFP detection in TH-GFP adult brain (a) and ventral 
midbrain primary cell culture (b). DIV, days in vitro. SN, substantia nigra; 
VTA, ventral tegmental area. P112, postnatal day 112. c, Scheme of 
dopaminergic transcription factors (TFs) screening. d, TH staining in iDA cells 
and in uninfected MEFs (inset) after 16 days in vitro. e-g, i-l, iDA cells are 
positive for the dopaminergic markers TH (e-g), VMAT2, ALDH1A1, 
calbindin and DAT (i-l). h, Quantification of TuJ1* and TH" cells. AN, 
Mash1, Nurr1; ANLa, Mash1, Nurr1, Lmx1a; NI, non-induced. Data are 
presented as mean = s.e.m. Scale bars: 500 um (a), 20 ttm (b, j), 50 pm (e- 

g, k, 1) and 100 um (d, i). 
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Nurr1/Lmx1a factor combination, double GFP*/TH* cells represented 
the majority of the induced TuJ1 neuronal cells (85 + 4%). Supplemen- 
tation ofa fourth factor among the remaining ones failed to produce any 
further increase in GEP* cells, with Brn2 and Mytil, the other two iN 
factors, even reducing the overall reprogramming efficiency (data not 
shown). For these reasons we focused on cells reprogrammed exclu- 
sively with the Mash1/Nurrl/Lmxla factor combination. The same 
gene cocktail was also proficient in reprogramming adult mouse fibro- 
blasts with high efficiency (Supplementary Fig. 2). 

Sixteen days after reprogramming, a large number of GFP cells 
expressed many of the distinctive components of the dopaminergic 
machinery, such as TH, vesicular monoamine transporter 2 (VMAT2; 
also known as SLC18A2), dopamine transporter (DAT; also known 
as SLC6A3), as well as aldehyde dehydrogenase lal (ALDH1A1) and 
calbindin (Fig. 1e-l). Conversely, markers associated with adrener- 
gic (dopamine-B-hydroxylase (DBH)) or serotonergic (tryptophan 
hydroxylase 1 or 2 (TPH1/2)) serotonin transporter (SERT; also 
known as SLC6A4) neurons were not induced (data not shown). 
Transcriptional analysis by reverse transcription—-polymerase chain 
reaction (RT-PCR) confirmed the activation of the dopamine-specific 
gene network including the endogenous expression of Nurrl and 
Lmx1a (Supplementary Fig. 3). Global expression analysis showed that 
iDA cells clustered with A9 and A10 adult mesencephalic dopaminergic 
(mDA) neurons rather than with their fibroblasts of origin as illustrated 
by hierarchical clustering (Fig. 2a, b) and the general degree of gene 
expression overlap (Fig. 2c). Of note, many representative genes of the 
dopaminergic phenotype, such as Th, Vmat2, Aadc (also known as Ddc), 
Ret, Gfral, Foxal, Gdnf and Drd2 were highly enriched (Fig. 2d). 
Conversely, genes coding for adrenergic and serotonergic biosynthetic 
enzymes were found to be not upregulated in the reprogrammed cells 
(Fig. 2e). Moreover, the fibroblast markers Twist2, Zeb2, Tgfblil and 
Chd2 (ref. 12) were downregulated in iDA cells (Fig. 2f). These findings 
indicate that the genetic reprogramming erased the majority of the 
evident expression hallmarks of the cell of origin, while specifically 
inducing the dopaminergic neuronal phenotype and not those of other 
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Figure 2 | Mouse iDA cells expression profiling. a, Heat-map of genes 
differentially expressed in RNA-microarray analysis performed on MEFs (NI), 
iDA cells and brain mDAs (A9-A10). b, c, Hierarchical clustering (b) and 
general degree of overlapping expression (c) among the three cell populations 
analysed. d-f, Scatter plots show that in iDA cells the majority of dopaminergic 
(DA) markers are increased, whereas other monoaminergic neuronal markers 
are not activated and fibroblasts markers are silenced. 
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closely related neuronal subtypes. It should be noted that iDA expression 
profiling was close but distinguishable from that of mDA neurons, with 
160 genes differently expressed with a =5-fold change (Supplementary 
Table 3). Whether this might indicate the presence of residual fibroblast 
gene expression in iDA cells remains to be addressed. 

Interestingly, Th and Vmat2 promoter regions were highly demethy- 
lated in dopaminergic neuronal cells whereas they were fully methy- 
lated in parental fibroblasts, indicating their epigenetic reactivation 
during dopaminergic neuronal conversion (Supplementary Fig. 4). 

GFP * cells induced by the three factors showed an elaborate neuronal 
morphology with multiple and long processes (Fig. 1d-l). Hence, we 
asked whether induced neuronal cells establish synaptic contacts in 
culture. Notably, synaptic resident proteins such as synaptotagmin I 
(SYT1) and synapsin (SYN) were localized in discrete puncta and colo- 
calized with TH immunolabelling, suggesting the establishment of 
dopaminergic synaptic terminals (Supplementary Fig. 5). Moreover, 
successful FM4-64 dye uptake at TH™ synaptic boutons indicated active 
synaptic processes (Supplementary Fig. 5). 

Next, we performed patch-clamp recordings of GFP" iDA cells 
(n = 16) as well as primary mDA neurons (n = 12) to compare their 
respective physiological properties'*"*. iDA cells had higher cell res- 
istance and lower capacitance than primary dopaminergic neurons, 
but showed normal resting membrane potential, normal Na‘ currents 
(Fig. 3a), overshooting action potentials (Fig. 3c), and even more 
prominent K~ currents (Fig. 3b) and afterspike hyperpolarizations 
(Supplementary Table 4). More than 80% of iDA cells showed rhyth- 
mic discharges (Fig. 3d, e) at an average frequency of 2.6 Hz. The 
identity of voltage-gated inward Na* and outward K™ currents in 
iDA cells has been verified pharmacologically (Supplementary Fig. 6). 
Next, we noted that iDA cells, like mDA neurons, express high levels of 
the D2 receptor (Fig. 3f). To verify whether the dopamine receptors 
were functional, we applied the specific D2/D3 receptor agonist quin- 
pirole (1 1M), which markedly suppressed neuronal firing in 6 of 10 
recorded cells ina reversible manner (Fig. 3g, h). Next, we used ampero- 
metry for real-time electrochemical detection of monoamine secretion 
from iDA cells'*’®. When carbon fibre electrodes were placed adjacent 
to GEP* cells (Fig. 3i), depolarization of 4 cells with 25 mM K* resulted 
in numerous amperometric events (Fig. 3i), reflecting quantal secretion 
of monoamines. Furthermore, high-performance liquid chromato- 
graphy (HPLC) measurements revealed that iDA cells contain high 
levels of intracellular dopamine, detectable in pellet preparations, which 
can be released in the extracellular medium on stimulation with 50 mM 
KCl (Fig. 3j). Thus, reprogrammed cells show several major properties 
of dopaminergic neurons in terms of spontaneous spiking activity, 
temporal parameters of action potentials, inhibition of cell firing 
through D2 autoreceptors and controlled dopamine release. 

Next, to determine the temporal requirements for the three exogenous 
factors to induce a stable reprogrammed cell state, infected MEFs were 
treated with dox for different time windows, after which dox was with- 
drawn. Only when fibroblasts were treated with dox for 6 or more days 
were numerous neuronal cells, mostly TH", observed (Supplementary 
Fig. 7). Thus, reprogramming is a relatively rapid process that requires 
the expression of the three factors for only 6 days. At the same time, iDA 
cells achieved a stable neuronal state over time independently from viral 
transgene expression and even at 18 days of dox withdrawal, iDA cells 
were found at the same number, and exhibited similar spontaneous firing 
as iDA cells constantly cultivated in the presence of dox (Supplementary 
Fig. 8 and Supplementary Table 4). 

Reprogramming of fibroblasts into differentiated neuronal cells 
might occur directly or by passing first through neural progenitors. 
When the DNA-base analogue BrdU was added from day 2 onwards to 
label the proliferating cells, virtually all neuronal cells were already 
post-mitotic after this time (Supplementary Fig. 9b, c, h). Despite 
the fact that during the first 2 days infected cells were actively prolif- 
erating in serum-containing medium, none showed expression of the 
neural progenitor molecular markers Sox2, Ngn2, Otx2, Lmx1b and 
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Figure 3 | Functional characterization of mouse iDA cells. a, b, Whole-cell 
voltage-clamp recording of Na* and K™ currents. c, Current-clamp recording 
of multiple action potentials evoked by current injection. d, e, Current-clamp 
recording and interspike interval frequency of spontaneous action potentials. 
f, D2 receptor (D2R) staining. g, h, Effect of the D2/D3 agonist quinpirole on 
spiking frequency (g) and its statistical analysis (h) (*P = 0.005, paired t-test, 
n= 6). i, Amperometric recordings of monoamine release after K* 
stimulation; release events are shown with high-resolution below the image of 
the recorded cell. j, Dopamine content measured by HPLC in uninfected (NI) 
and iDA cells, both in cell pellets and in the supernatant (SN) after KT 
stimulation. All cells are 16 days in vitro unless otherwise stated. Scale bars: 
50 um (b) and 20 um (i). Data are presented as mean + s.e.m. 


En1 (ref. 17) (Supplementary Fig. 9i). Furthermore, we used a genetic 
tracing system based on activation of the Sox2’®° LacZ reporter!’ 
showing that, as a proof-of-principle, LacZ activity was easily visua- 
lized upon reprogramming of Sox2*/8°° fibroblasts into induced 
pluripotent stem cells. By contrast, the reporter was never activated 
from the same cells when engaged into direct iDA reprogramming 
(Supplementary Fig. 9j). Altogether, these findings are inconsistent 
with the occurrence of detectable cell intermediates during the repro- 
gramming of fibroblasts into iDA cells. 

Next, the in vivo differentiation potential of iDA cells was assessed 
by orthotopic transplantations into neonatal mouse brains. Four days 
after viral transgene induction, infected cells were grafted into the 
ventricle of mouse newborn brains. Two and six weeks after transplan- 
tations, GFP” cells were found integrated in the host tissue showing an 
extremely elaborated morphology (Supplementary Figs 10 and 11). 
Most of the GFP* grafted neuronal cells were positive for TH, 
AADC, VMAT2 and DAT, indicating the acquirement of a full neur- 
onal dopaminergic cell fate (Supplementary Fig. 11b-g, i-l). Injection 
of brief supra-threshold current pulses evoked overshooting action 
potentials, and large Na* and K* currents were activated by depol- 
arizing voltage steps (Supplementary Fig. 11o, p). Therefore, iDA cells 
maintain excitability and major currents in vivo even after an extensive 
period of time from grafting. 

We then translated the same procedure to the human system by 
initially infecting IMR90 fetal fibroblasts. After 18 days from infection, 
we scored numerous Tui’ and TH™ neuronal cells accounting, 
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respectively, for 10+4% and 6+2% of the infected cells (Sup- 
plementary Fig. 12a-c). We then reprogrammed adult human fibro- 
blasts from two healthy donors (aged 42 and 55) and from two patients 
with genetic forms of Parkinson’s disease (Supplementary Table 5). 
Both healthy and diseased adult cells showed a comparable propensity 
to convert into neuronal cells, accounting for an estimated efficiency 
for TuJl* and TH™ cells of 5 + 1% and 3 + 1%, respectively (Fig. 4a— 
g). iDA cells were positive for ALDH1A1, TH, AADC, VMAT2 and 
DAT by immunocytochemistry (Fig. 4a-f) and gene expression ana- 
lysis (Supplementary Fig. 13). Cell conversion was stable over time as 
the number and morphology of human iDA cells was not obviously 
affected up to day 24 after reprogramming even when dox was with- 
drawn from day 6 onwards (Fig. 4h, i). 

Recordings in five infected fetal human iDA cells showed that the 
electrophysiological properties of these cells resemble mouse iDA cells 
(Supplementary Fig. 12). Recordings in eight infected adult human iDA 
cells revealed less mature phenotypes (Fig. 4j-1), with mean amplitudes: 
0.7+0.1nA for Na* currents; 1.2+0.1nA for delayed rectifier Kt 
currents; 78 + 3 mV for action potentials; and 7.5 + 2 mV for afterspike 
hyperpolarization. The identity of Na* and K™ currents was confirmed 
pharmacologically (Fig. 4m, n). Most importantly, depolarization of 
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Figure 4 | Characterization of human fibroblasts reprogrammed into iDA 
cells. a—f, Fibroblasts from a healthy donor (a-d) and a Parkinson’s disease (PD) 
patient (e, f) show a comparable efficiency in dopaminergic neuronal conversion. 
g, Quantification of iDA cells obtained from fetal (IMR90), healthy and 

Parkinson’s disease adult fibroblasts. h, Quantification of TuJl* and TH“ cells in 
a time-course study from 0 to 24 days in vitro (DIV). i, Quantification of TuJ1 « 
and TH reprogrammed cells kept with (w) or without (w/o) doxycycline for 6, 
12, 18 or 24 DIV. Data are presented as mean + s.e.m. j, k, Whole-cell voltage- 
clamp recording of Na‘ and K™ currents. 1, Whole-cell current-clamp recording 
of single action potential elicited by a minimal depolarization. m, n, Suppression 
ofNa™ (m) and K* (n) currents and action potentials by tetrodotoxin (TTX) and 
4-AP. 0, Amperometric recordings of release events after K* stimulation; high- 
resolution pattern is shown below the image of the recorded cell. All cells are 18 
days in vitro unless otherwise stated. Scale bars: 50 Jum (a-c) and 20 um (d-f, 0). 
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three cells with 25 mM K™ elicited numerous release events detected by 
amperometric measurements, as described earlier for mouse iDA cells 
(Fig. 40). In summary, these experiments indicate that actively spiking 
dopamine-secreting cells can be induced by forced expression of the 
three factors in adult human cells from both healthy donors and 
Parkinson’s disease patients. 

Here, we demonstrated that the combination of three transcription 
factors— Mash1, Nurrl and Lmx1la—can rapidly and efficiently induce 
dopaminergic neuronal cells from mouse and human fibroblasts. 
Reprogrammed cells are similar to brain dopaminergic neurons in gene 
expression, and show dopamine release and pacemaker activity that can 
be modulated via D2 receptors. Importantly, this cell conversion 
diverges with respect to developmental neuronal lineage commitment 
as it does not progress through detectable intermediate neuronal stages. 
How this might impact on the overall functional aspects of the cells will 
need to be tested further in long-term in vivo transplantation studies. 

Generation of functional dopaminergic neuronal cells by direct 
reprogramming opens new possibilities for regenerative therapies for 
Parkinson’s disease and related disorders. However, to be clinically 
relevant, the overall cell conversion process needs to be highly efficient 
and the addition of supplementary factors might be helpful’. 
Interestingly, a different gene cocktail capable of inducing dopaminer- 
gic-like neurons from human fibroblasts has been identified”®. This 
opens the intriguing possibility that different molecular fate determi- 
nants reach a similar endpoint even though acting on different tran- 
scriptional cascades. 

These procedures do not rely on pluripotent stem cells that are prone 
to tumours in their undifferentiated state. Moreover, the process 
described here does not pass through proliferative progenitors that also 
might be tumorigenic’’. Thus, these procedures might avoid a dangerous 
drawback of stem cell therapies while providing a sufficient number of 
functional dopaminergic neurons amenable for autologous cell replace- 
ment therapies. 


METHODS SUMMARY 

Cell culture and viral infection. MEFs were isolated from embryonic day (E)14.5 
wild-type or TH-GFP mouse embryos. Adult human fibroblasts isolated from 
healthy subjects and Parkinson’s disease patients as well as from human fetal lung 
fibroblasts (IMR90) were grown in MEF media. Cells were infected with dox- 
inducible lentiviruses as previously reported’. 

Electrophysiology and amperometry. Electrophysiological recordings were per- 
formed in on-cell and whole-cell configurations. Carbon-fibre microelectrodes 
were used for amperometric recordings’. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cell culture. MEFs were isolated from E14.5 wild-type or TH-GFP knock-in mice 
embryos. Head, vertebral column, dorsal root ganglia and all internal organs were 
removed and discarded and the remaining embryonic tissue was manually dis- 
sociated and incubated in 0.25% trypsin (Sigma) for 10-15 min. Cells from each 
embryo were plated onto a 15-cm tissue culture dish in MEF media (Dulbecco’s 
modified Eagle medium (DMEM; Invitrogen) containing 10% fetal bovine serum 
(FBS; Hyclone), non-essential amino acids (Invitrogen), sodium pyruvate and 
penicillin/streptomycin (Invitrogen)). In all experiments cells were not split more 
than four times. Mouse adult fibroblasts were isolated from tail tip samples. Tails 
were peeled, minced into 1 cm pieces, placed on culture dishes, and incubated in 
MEF media for 5 days. Adult human fibroblasts were isolated from skin biopsy 
samples of healthy and Parkinson’s disease patients’ provided from the “Cell 
Line and DNA Biobank from Patients affected by Genetic Diseases” (G. Gaslini 
Institute) and “Parkinson Institute Biobank” (Milan, http://www.parkinson. 
it/dnabank.html) of the Telethon Genetic Biobank Network (http://www. 
biobanknetwork.org). The informed consent as issued by the ICP Ethical com- 
mittee was obtained by healthy and Parkinson’s disease patients enrolled for the 
DNA and cell biobank collection. 

Human skin samples were mechanically dissociated and plated on matrigel- 
coated dishes. Human fibroblasts were cultured as MEFs. Mouse and adult fibro- 
blasts were grown in MEF media as well as human fetal lung fibroblasts IMR90 
(ATCC). Mesencephalic dopaminergic primary cell cultures from TH-GFP mice 
were prepared as previously described**. Mice were maintained at San Raffaele 
Scientific Institute Institutional mouse facility and experiments were performed in 
accordance with experimental protocols approved by local Institutional Animal 
Care and Use Committees. 

Molecular cloning and viral infection. Complementary DNAs for the dopami- 
nergic transcription factors were cloned into lentiviral vectors under the control of 
the tetracycline operator’. Replication-incompetent, VSVg-coated lentiviral particles 
were packaged in 293T cells. MEFs, IMR90 and adult mouse and human fibroblasts 
were infected in MEF media. 16-20 h after infection cells were switched into fresh 
MEF media containing doxycycline (2mg ml‘; Sigma). After 48h medium was 
replaced with neuronal inducing media (DMEM/F12 (Invitrogen), 25 jg ml insu- 
lin (Sigma), 50 pg ml * transferrin (Sigma), 30 nM sodium selenite, 20 nM proges- 
terone (Sigma), 100nM putrescine (Sigma) and penicillin/streptomycin (Sigma)) 
containing doxycycline. The medium was changed every 2-3 days for a further 
10-22 days. For proliferation assay, MEFs were treated with a 48h pulse of 10 1M 
BrdU. 

Immunohistochemistry. For immunocytochemical analysis, 5 X 10* mouse or 
human fibroblasts were plated on matrigel-coated glass coverslips the day before 
the infection. 10-28 days after viral infection cells were fixed for 20 min at room 
temperature (20-25 °C) in 4% paraformaldehyde in PBS, permeabilized for 30 min 
in PBS containing 0.1% Triton X-100 and 10% normal goat serum (NGS), and 
incubated overnight at 4 °C in PBS containing 10% NGS and primary antibodies. 
Then cells were washed three times with PBS and incubated for 2h at room 
temperature with anti-rabbit or anti-mouse secondary antibodies Alexa Fluor- 
488 or Alexa Fluor-594 (1:500, Invitrogen). For immunohistochemical analysis 
P15 or adult mouse brains were fixed overnight at 4 °C with 4% paraformaldehyde, 
buffered in 30% sucrose and embedded in OCT. Frozen brains were sectioned into 
15- or 40-um-thick sections with a cryostat and processed for immunostaining. 
Sections were boiled for 3 min in 10 mM citrate buffer solution pH 6 for antigen 
retrieval and permeabilized for 1 h at room temperature in PBS containing 0.1% or 
0.25% Triton X-100 and 10% NGS. Primary antibodies were as follows: mouse 
anti-TH (1:200, Millipore), rabbit anti-TH (1:200, Immunological Sciences), 
mouse anti-BllI-tubulin (1:500, Covance), rabbit anti-Blll-tubulin (TuJ1) 
(1:500, Covance), rabbit anti-VMAT2 (1:200, Chemicon), rat anti-DAT (1:500, 
Millipore), rabbit anti-D2 receptor (1:100, Millipore), rabbit anti-calbindin (1:200, 
Swant), rabbit anti-AADC (1:100, Novus Biologicals), rabbit anti-ALDH1A1 
(1:200, Abcam), mouse anti-synaptotagmin I (1:200, Synaptic Systems), mouse 
anti-synapsin (1:200; Synaptic Systems), chicken anti-GFP (1:2,000, Molecular 
Probes), rat anti-BrdU (1:200, BD), mouse anti-MAP2 (1:500, Immunological 
Sciences), rabbit anti-Otx2 (1:100 R&D). B-Galactosidase staining was performed 
as previously described”. 

Statistical analysis. The total numbers of Th* and Tujl~ cells were quantified 
12-24 days after infection. Cell counting was performed on ten fields from three 
replicates for each condition and normalized with the number of cells plated 
before the infection. Data were expressed as mean ~ s.e.m. 

RT-PCR. RNA was extracted from single cultures, using Trizol isolation system 
(Invitrogen) according to manufacturer’s instructions. The yield and integrity of 
the RNA were determined by the spectrophotometric measurement of A260 and 
by agarose-gel electrophoresis, respectively. Total RNA was treated with DNase I 
(Qiagen) to prevent DNA contamination. Two micrograms of RNA were reverse 


transcribed using the Transcriptor High Fidelity cDNA Synthesis Kit (Roche). 
One twentieth of the reverse transcribed cDNA was amplified in a 25 microlitres of 
reaction mixture containing Taq polymerase buffer (Fisher BioReagents), 0.2 mM 
dNTPs (Finnzymes OY), 0.4 micromolar of each primer, 1 U Taq polymerase 
(Fisher BioReagents). The primers used to amplify cDNA samples are listed in 
Supplementary Table 6. 

Cell sorting, laser capture dissection and microarray analysis. TH-GFP- 
positive iDA cells were directly sorted in Trizol (Invitrogen) using the cell sorter 
FACSVantage s.e. DiVa (Becton Dickinson). Thus RNA was extracted as reported 
earlier and biotin-labelled CRNA was obtained using the Ovation kit (NuGEN). 
Labelled cRNA was hybridized (CBM genexpression facility, SISSA) on 
Affymetrix Mouse Gene 1.0 ST Arrays, containing 35,557 probe sets correspond- 
ing to 28,853 genes. Hybridized arrays were stained and washed (GeneChip 
Fluidics Station 450) and scanned (GeneChip Scanner 3000 7G). Cell intensity 
values were computed using the Affymetrix GeneChip Operating Software 
(GCOS). Further data processing was performed in the R computing environ- 
ment (http://www.r-project.org/) version 2.8.0 with BioConductor packages 
(http://www. bioconductor.org/). Robust Multi- Array Average (RMA) normaliza- 
tion was applied’*. Data were then filtered based on probe set intensity, so that only 
probe sets that had intensity values of >50 in at least half the arrays were retained. 
Statistical analysis was performed with limma”’. P values were adjusted for mul- 
tiple testing using Benjamini and Hochberg’s method to control the false discovery 
rate**. Genes with adjusted P values below 0.01 were considered to be differentially 
expressed. Furthermore, a fold-change threshold cutoff was set to focus on genes 
whose expression level changed at least 2 times. Data were analysed through 
DAVID Bioinformatics Resources v6.7 (refs 28, 29). 

Gene expression profiles of adult A9 and A10 dopaminergic neurons were 
obtained as previously described””. In brief, adult TH-GFP female mice were killed 
by cervical dislocation. The brains were rapidly cut to isolate the midbrain region, 
and immediately immerged in 1X zinc fixative (BD Pharmingen) for 4-6h at 
+4 °C. Once fixed, tissues were moved to 30% sucrose in 1 X zinc fixative solution 
at +4 °C overnight. Following inclusion in OCT, tissues were frozen in iso-pentan 
(Sigma) and percooled with liquid nitrogen. 14-mm cryosections were mounted 
on SuperFrost plus glass slides (Menzel-Glaser) and air dried. mDA A9 and A10 
neurons (each one from three different mice) were isolated from cryosections by 
using a PALM LCM microdissection system (PALM Microlaser Technology). To 
facilitate detection of fluorescent neurons, a drop of 1 X zinc fixative was applied to 
the section during cell selection. The sections were air dried, neurons were dis- 
sected and catapulted onto PALM adhesive caps (Zeiss). Total RNA from 2,500 
pooled neurons was isolated by using the Nano RNA extraction kit (Stratagene) 
and contaminating genomic DNA was removed through on-column DNase diges- 
tion step. The common reference RNA was generated from three midbrain regions 
of age-matched female mice. Midbrain RNA was isolated using RNeasy Mini kit 
(Qiagen), followed by DNase treatment. RNA from dissected neurons and all 
midbrains was amplified and labelled by Ovation Pico kit, WT exon and Encore 
biotin labelling kit (Nugene), following manufacturer’s instructions. Once pre- 
pared, each target was hybridizated on MoExon 1.0ST GeneChip (Affymetrix). 
Statistical analysis was performed by oneChannelGUI R package. All hierarchical 
clusters were generated by TMEV software. 

Bisulphite genomic sequencing. DNA from sorted TH-GFP* reprogrammed 
MEFs was modified using the CpGenome modification kit (Chemicon) according 
to the manufacturer's recommendations. Thus Th and Vmat2 promoters CpG- 
rich selected regions were amplified using the PCR primers listed in Supplemen- 
tary Table 5. 

HPLC. To quantify dopamine level in reprogrammed cells, cell pellets were homo- 
genized in 100 pl 0.1 N HCIO, and analysed by using HPLC with electrochemical 
detection (Alexis 100, Antec Leyden). To measure dopamine concentrations in the 
supernatants, cells were exposed to media with or without 50 mM KC] for 30 min, 
then 0.9 ml of supernatants were collected with the addition of 0.1 ml of 1N 
HCIO,, filtered and analysed by HPLC. Dopamine was separated on a reverse- 
phase column (ALB-105, 3 um, 50 X 1mm) with a mobile phase consisting of 
50 mM phosphate buffer, 8mM KCl, 500 mg! ' octyl sodium sulphate, 0.1 mM 
EDTA, and 3% methanol (pH 6.0) at a flow rate of 50 pl min” '. Dopamine was 
detected by a Decade II electrochemical detector equipped with micro VT-03 
electrochemical flow cell and a 0.7-mm-diameter glassy carbon electrode (Alexis 
100, Antec Leyden). The volume of the injection was 5 pl. The detection limit 
established as a 3:1 signal-to-noise ratio was below 0.5 nM. 

Electrophysiology. Recordings were performed from reprogrammed mouse and 
human fibroblasts and primary mDA neurons. The mouse TH-GFP* cells 
selected for the electrophysiological analysis were not so flat as the fibrobalsts, 
and had several well-developed neurites. The human cells selected for the electro- 
physiological analysis also had neuron-like shapes with clearly distinguishable 
neurites by phase contrast microscopy. Only cells without signs of detachment 
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from the substrate were used for recordings. Cells were perfused continuously with 
HEPES-buffered saline (HBS) of the following composition (in mM): 140 NaCl, 5 
KCl, 2 CaCh, 2 MgCls, 15 HEPES, and 25 glucose, pH 7.4. The patch pipette 
solution contained (in mM): 130 K-gluconate, 10 KCI, 0.5 CaCl, 15 HEPES, 5 
EGTA, 8 NaCl, 2 MgATP, 0.3 NazGTP, and 10 glucose, pH adjusted to 7.2 with 
KOH. Action potentials were recorded in the on-cell and the current-clamp 
whole-cell configurations. A current was injected to have membrane potentials 
around —60 mV, and step currents from —50 pA to 40 pA were injected to elicit 
action potentials. Na‘ currents and composite K* currents were recorded in the 
voltage-clamp configuration by delivering voltage steps ranging from — 100 mV to 
+20 mV in cells held at —60 mV. Delayed rectifier K* currents were activated by 
0.5 s voltage steps from —40 mV to +20 mV after a 0.5-s-long step to —40 mV. 
A-type K* currents were isolated by subtraction of delayed rectifier K* currents 
from those activated by voltage steps after a 0.5-s-long step to —100mV. 
Recordings were performed using an EPC10 USB patch clamp amplifier and 
PATCHMASTER software (HEKA Elektronik). Data were digitized at 10 kHz 
and analysed with FITMASTER Software (HEKA Elektronik). Detection and 
measurements of action potentials were performed using MiniAnalysis software 
(Synaptosoft). 

In vivo electrophysiology. Slices were obtained from transplanted mice at postnatal 
day 42. The brains were quickly removed from the skull in ice-cold artificial cere- 
brospinal fluid (ACSF) containing the following (in mM): 125 NaCl, 25 NaHCO3, 
2.5 KCL, 1.25 NaH2PO,, 2 CaCh, 1 MgCl, and 25 glucose, pH 7.4 (bubbled with 
95% Oz and 5% CO ). Coronal slices (300-j1m thick) were cut using a vibratome 
(VT1000S; Leica) and stored in ACSF at 25-28 °C. For recording, slices were 
transferred to a recording chamber continuously superfused with ACSF (1-2 ml 
min! at 30-32 °C). Whole-cell recordings were performed in both current- and 
voltage-clamp configurations. Recording pipettes (3-5 MQ of resistance) contained 
the following solution (in mM): 124 KHPO,, 10 NaCl, 2 MgCh, 0.5 EGTA, 10 
HEPES, 2 Naz-ATP, 0.03 Na-GTP (pH 7.2, adjusted with KOH). Signals were 
sampled at 10 kHz, filtered at 2kHz, and acquired using a MultiClamp 700A 
amplifier and pClamp 10 software (Molecular Devices). 

Amperometric recording. Amperometry was used to detect the evoked catechol- 
amine exocytosis from single cells'**'. Carbon-fibre microelectrodes were fabri- 
cated from 5 um carbon fibres (Goodfellow), inserted in a 1.2 X 0.68 mm glass 
capillary (A-M system, Sequim) and pulled with a PE-22 micropipette puller 
(Narishige). Electrodes were sealed by dipping in Epoxy resin (Epo-Tek 301, 
Epoxy Technology) and cured at 100°C for 24h. They were backfilled with 3 M 
KCl and trimmed to obtain a basal current between 140 and 180 nA. The electro- 
des’ responses were tested by cyclic voltammetry and those with unstable cyclic 
voltammograms, when tested in a solution of 10 4M dopamine, were rejected. A 
voltage was applied to the carbon fibre using an EPC10 USB patch-clamp amplifier 
(HEKA Elektronik). The signal was low-pass filtered at 10 kHz using a 4-pole 
Bessel filter, digitalized at 50 kHz and digitally refiltered at 1-1,000 Hz. The latter 
resulted in slightly longer responses, but significantly improved the visualization of 
secretion events that was the only aim of these experiments. The electrode was 
positioned adjacent to individual cells and lowered to approach the somatoden- 
dritic domains of iDA cells*, using an Olympus BX51WI microscope with x40 
water immersion objective. To increase the signal-to-noise ratio, cells were pre- 
treated with 100 1M L-DOPA (Sigma-Aldrich) for 30 min although we were able 
to resolve single spike-like release events in two untreated cells. The experiments 
consisted of current recordings at +750 mV during a brief baseline period, during 
which cells were perfused with standard external medium containing 5 mM K*. It 
was then exchanged for a stimulation solution (25 mM K*), and amperometric 
signals were recorded for a further period of 7 min. Catecholamine secretion was 
apparent as discrete spike-like events, each corresponding to vesicular catechola- 
mine release. Most events were detected during 2 min after 25 mM K* stimulation, 
but occasional events were observed also during baseline recordings. No vesicular 
release of dopamine was recorded at the electrode placed adjacent to a cell when the 
applied potential was 0 mV or —750 mV, or at +750 mV when the electrode was 
placed remotely from cells. 

FM4-64 assay. FM4-64 dye uptake experiments were performed as previously 
reported®*. Briefly, 21 days in vitro TH-GFP* iDA cells were stimulated for 1 min 
with 55mM KCl, in the presence of FM4-64 (10 uM). After FM4-64 loading, 
neuronal cells were washed and perfused for 10 min with warmed Krebs buffer 
(37 °C) supplemented with TTX (1 4M) and CNQX (10 1M). After live fluorescent 
FM4-64 signals were acquired, cells were fixed and immonostained for TH and 
SYT1. 
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Electron microscopy. For ultrastructural immunocytochemistry, 21 days in vitro 
infected MEFs were fixed in 2% glutaraldehyde in PBS, washed in PBS, postfixed in 
2% OsO, in PBS, and embedded in Epon. Ultrathin sections prepared from these 
samples were analysed with electron microscope (H-7000; Hitachi). 
Transplantation surgery. After 4 days of infection, TH-GFP MEFs were trypsinized 
and resuspended at 2 X 10° cells jl’ in fresh prepared Krebs buffer containing the 
following (in mM): 126 NaCl, 2.5 KCl, 1.2 NaH,PO,, 1.2 MgCh, 2.1 CaCl, 11 
glucose, 4.2 NaHCO3;, 1 HEPES, and 1% vital dye Fast Green. P1 mice pups were 
anaesthetized by hypothermia (4 min) and fixed to a support using band-aid. The 
skin and the skull overlying the lateral ventricle were opened over about 2 mm using 
an ophthalmic scalpel. Subsequently, the mouse was placed in a stereotaxic rig (Kopff) 
under a Hamilton syringe containing 2 1 of cell suspension. The syringe was placed 
over the incision, positioned at the level of the skull, then lowered into the lumen of 
the right lateral ventricle (LV; 2.5 mm) or in the somatosensory cortex (1.5 mm) and 
cell solution was injected. Mice were left on a 37°C heating blanket for several 
minutes after surgical manipulation to avoid fatal hypothermia. 

Stereological analysis. Three mice transplanted with reprogrammed cells were 
used for stereological analysis. Three weeks after transplantation mice were anaes- 
thetized and killed by transcardiac perfusion with PBS followed by 4% para- 
formaldehyde. Brains were cryoprotected through incubation in an ice-cold 
solution of 30% sucrose in PBS and cut in coronal 40-m-thick cryostat sections. 
From these sections, one systematic random series of sections was stained for GFP, 
so that sections were spaced at 7 section intervals (total of 16 sections per mouse). 
GFP immunoperoxidase staining was performed as described elsewere**. Cells 
were quantified using the assistance of the Stereo Investigator v 3.0 software 
(MicroBrightField) and a personal computer running the software connected to 
a colour video camera mounted on a Leica microscope****. The motorized stage of 
the microscope allowed precise and well-defined movements along the x-, y- and 
z-axes. Images were first acquired with a CCD-IRIS colour video camera and the 
cerebral hemispheres were interactively delineated at low magnification on a video 
image of the section. Counting of cells was performed manually on every seventh 
section using a X40 lens. To estimate the total number of GFP positive cells the 
total number of neurons counted on the sections was multiplied by seven. 
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Neurogenic transcription factors and evolutionarily conserved sig- 
nalling pathways have been found to be instrumental in the forma- 
tion of neurons!”. However, the instructive role of microRNAs 
(miRNAs) in neurogenesis remains unexplored. We recently dis- 
covered that miR-9* and miR-124 instruct compositional changes 
of SWI/SNF-like BAF chromatin-remodelling complexes, a pro- 
cess important for neuronal differentiation and function**. 
Nearing mitotic exit of neural progenitors, miR-9* and miR-124 
repress the BAF53a subunit of the neural-progenitor (np)BAF 
chromatin-remodelling complex. After mitotic exit, BAF53a is 
replaced by BAF53b, and BAF45a by BAF45b and BAF45c, which 
are then incorporated into neuron-specific (n)BAF complexes 
essential for post-mitotic functions*. Because miR-9/9* and miR- 
124 also control multiple genes regulating neuronal differenti- 
ation and function®’"’, we proposed that these miRNAs might 
contribute to neuronal fates. Here we show that expression of 
miR-9/9* and miR-124 (miR-9/9*-124) in human fibroblasts 
induces their conversion into neurons, a process facilitated by 
NEUROD2Z. Further addition of neurogenic transcription factors 
ASCLI1 and MYTIL enhances the rate of conversion and the 
maturation of the converted neurons, whereas expression of these 
transcription factors alone without miR-9/9*-124 was ineffective. 
These studies indicate that the genetic circuitry involving miR-9/ 
9*-124 can have an instructive role in neural fate determination. 
During the course of exploring the roles of miR-9/9* and miR-124 
(ref. 5), we noted that these miRNAs induced neuronal morphologies 
in cultured cells. To explore this effect in greater detail, we prepared a 
single lentiviral vector that expresses both precursors of miR-9/9* and 
miR-124 along with a turbo red fluorescent protein (tRFP) marker, 
and infected human neonatal foreskin fibroblasts (Supplementary Fig. 1). 
The fibroblast culture was free of neural progenitors, keratinocytes or 
melanocytes (Supplementary Figs 2-4). Remarkably, fibroblasts expres- 
sing miR-9/9*-124 showed a rapid reduction in proliferation, displayed 
neuron-like morphologies (Supplementary Fig. 5) and expressed 
MAP2, a marker of post-mitotic neurons, within 4 weeks after infection 
(Fig. 1a, left). This was owing to synergism between miR-9/9* and miR- 
124, as expressing these miRNAs separately did not lead to the appear- 
ance of MAP2-positive cells (Supplementary Fig. 6). In light of the low 
percentage of MAP2-positive cells obtained with miRNAs only (less 
than 5%, Fig. 1b), we began adding neurogenic transcription factors 
and found that NEUROD2 (refs 14-17) was most effective at increasing 
the conversion frequency (Supplementary Fig. 7). We estimate that 
~50% of these cells have acquired neuronal fates as indicated by 
MAP2 expression 30 days after infection (Fig. la, right, b). However, 
because cells detached, remained uninfected or died during the con- 
version process, a conservative estimate is that ~5% of the starting 
cells became neurons. Importantly, neither NEUROD2 alone nor 
non-specific miRNA (miR-NS) could convert fibroblasts into neurons 


(Fig. 1b), demonstrating the essential role of miR-9/9*-124 in this pro- 
cess. Synergism between miR-9/9* and miR-124 seemed to be crucial: 
expressing miR-9/9* and miR-124 individually with NEUROD2 failed 
to produce MAP2-positive cells (Supplementary Fig. 6). Using EdU- 
incorporation, we found that miR-9/9*-124-infected fibroblasts had 
exited the cell cycle 1 week after infection (Supplementary Fig. 8), con- 
sistent with the anti-proliferative role of these miRNAs’. Lastly, immu- 
nostains indicated that the induced neurons expressed SCN 1a, a key 
contributor to neuronal excitability, as well as synapsin 1 and NMDA 
receptor 1 (Fig. 1c). 

Using whole-cell patch recording, we found that injecting depolariz- 
ing current in induced neurons (cultured up to 8 weeks) could con- 
sistently trigger single action potentials and in some cases, repetitive 
firing (Fig. 1d). Moreover, their resting membrane potential 
(—34.1 + 1.7 mV; Supplementary Fig. 9) was significantly more nega- 
tive than that of control fibroblasts (—20.4 + 0.6 mV, n = 4). Applying 
a series of voltage steps to the induced cells evoked large inward 
currents closely followed by outward currents, which were not 
observed in the fibroblasts (Supplementary Fig. 10). Importantly, add- 
ing 1 uM tetrodotoxin (TTX) completely and reversibly blocked the 
initial inward current, confirming that the current was due to voltage- 
gated sodium channels (Fig. le), as would be expected from the 
current-voltage (I-V) curve of inward currents (Fig. 1f, left). The 
I-V curve of outward currents showed the characteristics of voltage- 
gated potassium channels in neurons (Fig. 1f, right). Moreover, some 
of these cells exhibited postsynaptic currents, which could be reversibly 
blocked by 2,3-dihydroxy-6-nitro-7-sulfamoyl-benzo[f]quinoxaline- 
2,3-dione (NBQX) and 2-amino-5-phosphonopentanoic acid (APV) 
(Supplementary Fig. 11). 

We examined the ability of cells converted by miR-9/9*-124- 
NEUROD2 to elicit a stimulation-dependent calcium influx using 
the calcium indicator Fluo2. Field stimulation triggered calcium influx 
that could be abolished by adding TTX (Fig. 1g) or 200 pM Cd** 
(Supplementary Fig. 12), demonstrating the ability of converted cells 
to support activity-dependent Ca’ influx through voltage-gated Ca** 
channels without any requirement for a pre-pulse. Activity-dependent 
uptake and release of the lipophilic dye FM1-43 was used to evaluate 
the ability to form functional presynaptic terminals'*. We found 
that the induced cells were able to take up and release FM dyes in a 
stimulation-dependent and Ca**-dependent manner (Fig. 1h). 

Because the miR-9/9*-124-NEUROD2-induced cells only occa- 
sionally showed repetitive action potentials, we sought to optimize 
the maturation of the cells by introducing additional neurogenic factors. 
Because ASCL1 and MYTIL were previously shown to be important for 
converting mouse embryonic fibroblasts into functionally mature neu- 
rons’, we expressed miR-9/9*-124 together with NEUROD2, ASCL1 
and MYTIL (DAM). We found that the miR-9/9*-124-DAM- 
converted cells were positive for MAP2 expression in approximately 
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Figure 1 | miRNA-induced transformation of human fibroblasts. a, MAP2 
expression in miR-9/9*-124 only (left) and miR-9/9*-124-NEUROD2 (right) 
converted cells. DAPI, 4’,6-diamidino-2-phenylindole. Scale bar, 20 um. 

b, Quantification of MAP2-positive cells with processes at least three times the 
length of the cell body from ten random fields. The graph represents the 
percentage of MAP2-positive cells over DAPI-positive cells. miR-9/9*-124 
only: n = 558; miR-9/9*-124-NEUROD2: n = 658 cells. The error bars are 
s.e.m. MAP2 signal was undetectable in fibroblasts infected with miR-NS, with 
or without NEUROD2. Scale bar, 20 um. c, Expression of SCN1a, synapsin 1 
and NMDARI in miR-9/9*-124-NEUROD2-converted cells. Scale bar, 20 um. 
d, Representative traces of action potentials recorded in current clamp in miR- 
9/9*-124-NEUROD2-converted cells. Twelve out of 22 cells showed single 
action potentials and 2 cells showed repetitive firing. e, A representative 
example of a series of voltage steps applied to an miR-9/9*-124-NEUROD2- 
induced neuron held at —70 mV. An inward current was observed, blocked by 
1 uM TTX, and reversed after TTX washout (n = 6). f, I-V curve for the peak 


100° | 
os 
2s 
a | 
aS | 
a. 
Bx 50 | 
ad 
2 | 
=8 Ol. ] 
miR-NS miR-9/9* 
DAM -124-DAM 


20ms_ 


EPSC 
-- s 
eG . 100 ms 
-_= x 
at ill alate _|% 
oO 
10 ms“ 20 ms 


LETTER 


> 
Ss 
Is oO 
| | ! 100 ms 
+20 pA 
rs 
e TTX washout 
Baseline TTX (1 uM) washou 
‘ & é 
So 
10ms ‘6 
f S I-V curve (Na*) I-V curve (K*) 
50.25 1 
5 0 +——— 
Boos 0.5 
a i 
5 -0.25 
Z -100-50 0 50100 -100-50 0 50 
Voltage (mV) Voltage (mV) 
Uptake Release 
ee © 1,200 APs 10 Hz 
= 1.0 ia 
= s 1.0 
5 50.8 
205: o” 
g 20.6 
a ® 
0 50.4 
2 O1mM2mM «& Q 50 100150200 
[Ca?*] Time (s) 


inward (left) and outward (right) currents. g, An example of Ca”* influx in 
induced neurons as measured by Fluo2-AM imaging. Images show the peak 
Fluo2-AM signal on stimulation before and during TTX application, and after 
TTX washout, respectively. The graph represents the changes in Fluo2 signal 
over time (circles, no TTX; triangles, with TTX). AP, action potential. Field 
stimulation is indicated by the black bar. Scale bar, 2 um. h, An example of 
vesicle recycling measured by FM1-43 imaging in induced neurons. Top 
diagram illustrates the protocol of FM uptake or release experiments. The 
images represent the typical FM1-43 dye uptake signal (middle) in a converted 
cell marked by tRFP (left). The left graph shows the measurement of FM1-43 
loading in 2mM Ca**, which was significantly reduced in low Ca?* 
concentration (0.1 mM). The right graph quantifies FM1-43 release (de- 
staining) during stimulations. FM1-43 signal was measured from n > 600 
boutons (arrows indicate examples of boutons) from 4 cultures. All error bars 
are s.e.m. In some cases, the s.e.m. is too small to be resolved. Scale bar, 4 um. 


Figure 2 | Additional neural factors enhance the conversion to neurons. 

a, MAP2 (left) and B-III tubulin (right) immunostaining of miR-9/9*-124- 
DAM-converted cells. Scale bar, 40 um. The graph represents the percentage of 
MAP2-positive cells over DAPI-positive cells. The error bars are s.e.m.; 1 = 150 
cells. b, A representative current clamp recording from a cell with typical 
neuronal morphology (see inset; scale bar, 50 um). Voltage deflections were 
elicited by somatic current injections of various amplitudes (A = 5 pA). c, A 
representative voltage clamp recording of the net current at various membrane 
potentials (—40 to +20 mV, AV= 10 mV, Vioig = -90 mV). d, A 
representative trace of spontaneously active cells recorded in cell-attached 
mode. e, A representative trace demonstrating spontaneous EPSCs. 

f, Representative traces of evoked postsynaptic currents (left, EPSC; right, 
IPSC) obtained in response to local field stimulation with single current pulses 
(1 ms) of various amplitudes (left, 0.25 and 0.3 mA; right, 0.3 and 0.4 mA) at 
different membrane holding potentials (left, —70 mV; right, 0 mV). The arrows 
indicate the time when stimulation was applied. Stimulation artefacts were 
eliminated for clarity. 
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80% of the cells remaining on the coverslips (Fig. 2a, representing 
~10% of the initially plated cells), and showed extensive neurite out- 
growth as illustrated by B-III tubulin staining (Fig. 2a, right). In addi- 
tion, miR-9/9*-124-DAM resulted in complete exit from cell cycle as 
assayed by EdU pulsing for 4 days (0/176 positive) whereas nearly all 
control cells were positive for EdU (97/107). Importantly, DAM factors 
with miRNA-NS failed to produce neurons as assayed by MAP2 stain- 
ing (Supplementary Fig. 13). About 80% of the miR-9/9*-124-DAM- 
converted cells were able to fire repetitive action potentials in response 
to current injections, and showed typical sodium and potassium 
currents during voltage clamp depolarizations (Fig. 2b-c and Sup- 
plementary Figs 9 and 14). Among recorded cells, we also observed 
spontaneously active cells (2/21) (Fig. 2d). Spontaneous excitatory 
postsynaptic currents (EPSCs) were seen in 10 out of 14 induced cells 
(Fig. 2e) without co-cultured primary neurons (Supplementary Figs 9 
and 15a). Furthermore, the induced cells exhibited evoked EPSCs and 
inhibitory postsynaptic currents (IPSCs) in response to local stimu- 
lation (Fig. 2f). Importantly, neuronal identity was stable after the 
removal of exogenous expression of miR-9/9*-124 and DAM after 3 
weeks of induction, as they still stained positive for SV2 and synapsin 1 
(Supplementary Fig. 16). 

We next performed single-cell analysis to characterize the types of 
neurons in miR-9/9*-124-DAM-induced cells. From randomly col- 
lected single cells 4 weeks post-infection, we analysed a total of 45 
induced neurons (based on MAPT and TUBB3 co-expression) for 
genes expressed in different types of neurons. We found that most 
induced cells were positive for genes expressed in cortical layers 
(Fig. 3a and Supplementary Fig. 17). Interestingly, we did not detect 
a peripheral nervous system marker (peripherin) or dopaminergic/ 
noradrenergic markers (DDC, TH). Striatal markers (DLX5 and 
DARPP32 (also known as PPP1R1B)), the serotonergic marker 5HT- 
2C (also known as HTR2C), and cerebellar genes (PCP2, GRP, TPM2) 
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Figure 3 | Characterization of induced neurons and nBAF subunit 
expression. a, Multiplex quantitative polymerase chain reaction (qPCR) of 45 
induced neurons for genes of specific brain regions and cell types. Cb, 
cerebellum; Dpa, dopaminergic; Exc, excitatory; Inh, inhibitory; Mb, midbrain; 
P, peripheral nervous system; S, serotonergic; Str, striatum; Ubq, ubiquitous; 
Vg, voltage gated. b, Fibroblasts stained negative for neuron-specific subunits of 
BAF complexes (top) whereas induced neurons expressed BAF45b, BAF45c 
and BAF53b (bottom). Scale bar, 20 um. 
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were expressed in only a small number of cells. The miR-9/9*- 
124-DAM-induced cells seemed to be heterogeneously excitatory 
(VGLUTI1 (also known as SLCI7A7) and SLC1A2) and inhibitory 
(GAD67 (also known as GAD1) and DLX1) (Fig. 3a), which is further 
supported by immunostains of VGLUT1 and GABA (Supplementary 
Fig. 18). Moreover, by 4 weeks post-infection, the induced cells already 
expressed genes important for synaptic structure and function, includ- 
ing SYN1, BSN, PCLO and SHANK3 (Fig. 3a). 

miR-9* and miR-124 target separate sites in the 3’ untranslated 
region (UTR) of BAF53a, a subunit of BAF complexes resulting in 
repression of BAF53a and activation of BAF53b, which is involved 
in an evolutionarily conserved program of neural development”. 
Remarkably, we found that all of the nBAF subunits (BAF53b, 45b 
and 45c) were induced in the converted cells (Fig. 3b). In embryonic 
stem cells, BAF complexes function across the genome at several 
thousand sites to control placement of polycomb repressive complex 
2 and the H3K27me3 repressive mark*'”*. Hence, one role of the 
miRNAs might be to induce stable epigenetic changes involving 
polycomb function across the genome. 

In addition to BAF53a, miR-9/9* and miR-124 also target other 
genes essential for neurogenesis and neuronal functions” including 
components of the REST complex such as REST and CoREST*??**, 
and PTBP-1 (ref. 7). We found that human fibroblasts expressed 
BAF53a, which could be repressed by miR-9/9*-124 (Supplementary 
Fig. 19). However, prolonging the expression of BAF53a only incom- 
pletely blocked neuronal conversion of fibroblasts, as assayed by MAP2 
staining (data not shown). Prolonging the expression of REST, CoREST 
or PTBP1 yielded similar results (data not shown). These findings 
indicate that in inducing cell fate transformations, the miRNAs miR- 
9/9* and miR-124 operate programmatically on multiple targets. 


Figure 4 | Conversion of adult fibroblasts by miR-9/9*-124-DAM. 

a, Immunostaining of B-III tubulin (left), MAP2 (middle) and neurofilament 
(right) in dermal fibroblasts of a 30-year-old individual converted by miR-9/9*- 
124-DAM. Scale bar, 20 1m. b, A representative current clamp recording. 
Voltage deflections were elicited by somatic current injections of various 
amplitudes (A = 2 pA). ¢, A representative voltage clamp recording of the net 
current at various membrane potentials (—40 to +20 mV, AV = 10 mV, 
Viola = —90 mV). d, A representative trace of spontaneous EPSCs. 

e, Representative traces of evoked postsynaptic currents obtained from 
converted adult cells (see inset; scale bar, 50 1m) in response to local field 
stimulation with single current pulses (1 ms) of various amplitudes (left, 0.2 and 
0.25 mA; right, 0.4 and 0.5 mA). Evoked EPSPs and IPSCs were recorded at 
—70mV and +30 mV in the presence of picrotoxin and NBQX/APV, 
respectively. S, stimulator. 
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Lastly, we asked whether our approach could be effective in con- 
verting adult fibroblasts. We found that adult human dermal fibro- 
blasts (from a 30-year-old female) could be converted into neurons 
(Fig. 4a), albeit more slowly. Recordings from adult cells converted by 
miR-9/9*-124-DAM 6 weeks after infection showed that they were 
able to generate action potentials (Fig. 4b). They also demonstrated 
typical voltage-gated sodium and potassium currents (Fig. 4c), spon- 
taneous EPSCs (Fig. 4d and Supplementary Fig. 15b) and evoked 
EPSCs and IPSCs (Fig. 4e) without co-cultured primary neurons. 

Our studies show that activating a neural developmental regulatory 
circuit involving miRNAs in human fibroblasts can surprisingly induce 
their conversion into neurons, indicating an instructive role for this 
circuitry in human neurogenesis. In our study, neurogenic transcrip- 
tion factors, delivered either singly (NEUROD2), or in combination 
(NEUROD2, ASCL1 and MTYL1), seem to function synergistically with 
the neurogenic activities of miR-9/9*-124. This raises the possibility of 
inducing various types of neurons using miR-9/9*-124 together with 
different sets of transcriptional factors. 


METHODS SUMMARY 


Transduction of human fibroblasts. A synthetic cluster of miR-9/9* and miR-124 
validated previously to express miR-9* and miR-124 (ref. 5) was inserted down- 
stream of tRFP in the pLemiR lentiviral construct carrying a puromycin selection 
cassette (Open Biosystems) driven by either a CMV promoter or a doxycycline- 
responsive promoter. A non-silencing sequence, which produces miRNA-NS, was 
used as a control (Open Biosystems). Each transcription factor was cloned down- 
stream of the EFlo promoter in a separate lentiviral construct. Typically, infected 
human fibroblasts were maintained in fibroblast media for 3-4 days before selection 
with appropriate antibiotics in Neuronal Media (ScienCell) supplemented with VPA 
(1 mM) and basic FGF (20 ngml ’). dbcAMP (500 tM) was added 10 days later to 
enhance cell survival. Human BDNF and NT3 (10 ng ml |; Peprotech) were added 
to the media after 3-4 weeks. Media were changed every 4 days. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Plasmid construction and viral preparation. We have previously constructed a 
synthetic cluster that expresses the precursors of both miR-9/9* (NCBI and 
miRBASE accession numbers MIMAT0000441 and MIMAT0000442) and miR- 
124 (accession number MIMAT0000422) and validated its ability to generate 
mature miRNAs of both. Here we cloned this cluster downstream of a tRFP 
marker into pLemiR (Open Biosystems), driven by either a CMV promoter or a 
doxycycline-responsive promoter. A non-silencing sequence in pLemiR (miR-NS, 
which produces non-specific miRNA) was used as a control (Open Biosystems). 
cDNA of each neural transcription factor used in this study was cloned down- 
stream of the EFlo promoter in a separate lentiviral construct with blasticidin or 
neomycin selection. For doxycycline experiments, human fibroblasts were first 
infected with a lentiviral construct expressing rtTA under the EFla promoter and 
stably selected with hygromycin. Infectious lentiviruses were collected 36-60h 
after transfection of Lenti-X 293T cells (Clontech) with appropriate amounts of 
lentiviral vectors, psPAX2 and pMD2.G (Addgene) using Fugene HD (Roche). 
Cell culture. All fibroblast cultures (human neonatal foreskin fibroblasts (ATCC, 
PCS-201-010) and adult dermal fibroblast (ScienCell)) were maintained in fibroblast 
media (Dulbecco’s Modified Eagle Medium; Invitrogen) containing 10% fetal bovine 
serum (FBS; Omega Scientific), B-mercaptoethanol (Sigma-Aldrich), non-essential 
amino acids, sodium pyruvate, GlutaMAX, and penicillin/streptomycin (all from 
Invitrogen). The day before lentiviral infection, human fibroblasts were seeded onto 
gelatin-coated 24-well tissue culture dishes (MidSci). Next day, cells were infected 
with filtered viral supernatants in the presence of polybrene (8 jig ml_') overnight. 
Fresh media were then replaced for 2-3 days with appropriate antibiotics to select for 
infected cells. Four days after infection, the media was changed to Neuronal Media 
(ScienCell) supplemented with VPA (1 mM), basic FGF (20 ng ml ') with media 
changes every 4 days. Our experiments to optimize the conversion protocol indicated 
that VPA and bFGF are beneficial for conversion for the first 2-3 weeks. We also 
added dbcAMP (0.5 mM, Sigma) after 2 weeks to the media as we found it enhanced 
cell survival. Human BDNF and NT3 (10 ngml ', Peprotech) were added to the 
media after 3-4 weeks to promote the survival of the induced cells. To facilitate 
immunostaining and electrophysiological studies in some experiments, cells were 
trypsinized (0.05% trypsin, Invitrogen) at about 10 days after infection and re-plated 
onto poly-ornithine (Sigma-Aldrich)/laminin (Roche)/fibronectin (Sigma-Aldrich)- 
coated glass coverslips. 

Immunofluorescence. The following antibodies were used for the immunofluor- 
escence studies: mouse anti- MAP2 (Sigma-Aldrich, 1:750), chicken anti-MAP2 
(Abcam, 1:30,000), mouse anti-fB-III tubulin (Covance, 1:30,000), rabbit anti- 
VGLUTI (Synaptic Systems, 1:2,000), rabbit anti-SCN1a (Abcam, 1:1,000), rabbit 
anti-NMDARI (Abcam, 1:2,000), rabbit anti-neurofilament 200 (Sigma-Aldrich, 
1:2,000), mouse anti-SV2 (Developmental Studies Hybridoma Bank, 1:100), rabbit 
anti-GABA (Sigma, 1:2,000) and rabbit anti-synapsin1 (Cell Signaling, 1:200). 
Antibodies against BAF subunits were generated in our laboratory and used as 
the following concentrations: BAF45b (1:250), BAF45c (1:1,000) and BAF53b 
(1:500). The secondary antibodies were goat anti-rabbit or mouse IgG conjugated 
with Alexa-488, -568 or -647 (Invitrogen). For SCNla and BAF53b staining, 
biotinylated secondary antibodies were detected using TSA amplification kit 
(Invitrogen). EdU incorporation assay was performed according to the manufac- 
turer’s protocols (Invitrogen). Images were captured using a Leica DM5000B 
microscope with Leica Application Suite (LAS) Advanced Fluorescence 1.8.0 
and a Leica DMI4000B microscope with LAS v.2.8.1. 

Electrophysiology. Recordings were performed on fibroblasts 5-8 weeks after 
infection for both miR-9/9*-124-NEUROD2 and miR-9/9*-124-DAM converted 
cells, which were co-cultured with mouse glia. Data were acquired in whole-cell 
mode at room temperature (25 °C) using an Axopatch 200B amplifier (Molecular 
Devices) or EPC 10 amplifier (HEKA) and sampled at 5 kHz with a 2 kHz low-pass 
filter. Recording pipette resistance was 2-6 MQ. Intrinsic neuronal properties were 
studied using the following solutions (in mM): extracellular, 140 NaCl, 2.5 KCl, 2.5 
CaCl, 2 MgCl,, 1 NaH»PO,, 20 glucose, 10 HEPES, pH 7.4; intracellular, 120 
KGluc, 20 KCl, 4 NaCl, 4 Mg,ATP, 0.3 NaGTP, 10 Na2PCr, 0.5 EGTA, 10 HEPES, 
pH 7.25. Synaptic activity was measured using the same extracellular solution, 
supplemented with 50 uM APV, and the following intracellular solution (in mM): 


135 CsMeS, 5 CsCl, 10 HEPES, 0.5 EGTA, 1 MgCl, 4 MgoATP, 0.4 NaGTP, 5 QX- 
314, pH 7.4 CsOH. Correspondingly, EPSCs were measured at —70 mV whereas 
IPCSs were measured at either 0 or +30 mV (with 10 1M NBQX and 50 uM APV, 
Eq, approximately —79 mV). Evoked postsynaptic currents were elicited by a 
stimulating electrode (CBAEC75, FHC) positioned 100-150 um aside from the 
cell soma through which brief (1 ms) unipolar current pulses of various amplitudes 
(0.1-0.9 mA, A = 0.05-0.1 mA) were applied. Recordings were filtered at 2 kHz 
and digitally sampled at 10 kHz. Data were collected and initially analysed with 
Clampfit 10 or the Patchmaster software (HEKA). Further analysis was performed 
using IgorPRO and MS Excel. Series resistance was left uncompensated owing to 
the fragility of the cells, but was corrected in the current clamp calculations. The 
liquid junction potential was calculated to be 15 mV (Clampfit) and corrected in 
calculating resting membrane potentials according to published methods”. 
Calcium imaging. Cells were loaded with Fluo2-AM (5 1M, TEFLABS) in Tyrode 
solution (150 mM NaCl, 4mM KCl, 2mM CaCl, 2mM MgCh, 10mM glucose, 
10 mM HEPES, 310-315 mOsm, with pH at 7.35) for 30 min in a 37 °C incubator. 
After two washes with Tyrode, cells were imaged using a filter cube (excitation 
470 + 20nm and emission 535 + 50 nm). In some cases, 1 uM TTX or 200 uM 
CdCl, was superfused. All images were converted to TIFF files and analysed 
off-line with Metamorph or ImageJ. All error bars represent s.e.m. For analysis 
of FM-positive puncta, 1.3 um in diameter regions of interest were used to cover 
functional boutons. Photobleaching was corrected by fitting the pre-stimulation 
baseline to a linear curve. 

FM1-43 imaging. Cells were superfused with Tyrode solution. Switching of super- 
fusion solution was carried out with a precision of <2s. Solutions contained 
10 uM NBQX and 50 uM b-APV (Tocris Bioscience) to prevent possible recurrent 
activity and synaptic plasticity. All experiments were performed at room temper- 
ature and neurons were stimulated with platinum electrodes. Putative presynaptic 
boutons were stained with 8 1M FM1-43 (Molecular Probes) using field stimulation 
for 120s at 10 Hz, followed by 60 s without stimulation to maximize the loading. In 
some experiments, 0.1 mM CaCl, was used to test the calcium dependency. After 
10 min of washing with dye-free Tyrode’s solution, individual boutons were de- 
stained by field stimulation. FM1-43 dyes were excited at 470nm (D470-40x; 
Chroma) and their emission was collected at 535 nm (535/50m). tRFP was excited 
at 535 nm (535/50ex) and its emission was collected at 580 nm (580 Ip). All images 
were taken at a frame rate of 1-3 Hz by a Cascade 512B camera. 

Single-cell qPCR. Single cells were collected by clone FACS sorting using a BD 
influx sorter (BD Biosciences) in 10 ul of a pre-amplification mix containing 
40 nM of all primers for genes of interest, and the following components of the 
CellsDirect One-Step qRT-PCR Kit (Invitrogen): 2 Reaction Mix, SuperScript 
II RT/Platinum Taq Mix. After sorting, samples were reverse transcribed and pre- 
amplified for 18 cycles. Pre-amplified samples were diluted (3) with TE buffer 
and stored at —20°C. Sample and assay (primer pairs) preparation for 96.96 
Fluidigm Dynamic arrays was done according to the manufacturer’s recom- 
mendation. Briefly, sample was mixed with 20 DNA binding dye sample loading 
reagent (Fluidigm), 20x EvaGreen (Biotium) and TaqMan Gene Expression 
Master Mix (Applied Biosystems). Assays were mixed with 2X assay loading 
reagent (Fluidigm) and TE to a final concentration of 5 1M. The 96.96 Fluidigm 
Dynamic Arrays (Fluidigm) were primed and loaded on an IFC Controller HX 
(Fluidigm) and qPCR experiments were run on a Biomark System for Genetic 
Anaylsis (Fluidigm). Data were collected and analysed using the Fluidigm Real- 
Time PCR Analysis software (v.2.1.3 and v.3.0.2). Melting curves were used to 
determine specificity of each reaction. Further data analysis was performed using 
Microsoft Excel. In addition to collected single-cell material, every experiment 
contained samples for four standard dilutions of a mixed human cDNA library. 
The collected cells were confirmed based on RSG18 (18S small ribosomal subunit) 
and GAPDH co-expression. Of these, induced neurons were identified by co- 
expression of two general neuronal genes MAPT and TUBB3 for further analysis 
of genes specific to brain regions and cell types. 


27. Barry, P. H. JPCalc, a software package for calculating liquid junction potential 
corrections in patch-clamp, intracellular, epithelial and bilayer measurements and 
for correcting junction potential measurements. J. Neurosci. Methods 51, 107-116 
(1994). 
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Crystal structure of the human centromeric 
nucleosome containing CENP-A 
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In eukaryotes, accurate chromosome segregation during mitosis 
and meiosis is coordinated by kinetochores, which are unique 
chromosomal sites for microtubule attachment’*. Centromeres 
specify the kinetochore formation sites on individual chromo- 
somes, and are epigenetically marked by the assembly of nucleo- 
somes containing the centromere-specific histone H3 variant, 
CENP-A*"*. Although the underlying mechanism is unclear, 
centromere inheritance is probably dictated by the architecture 
of the centromeric nucleosome. Here we report the crystal struc- 
ture of the human centromeric nucleosome containing CENP-A 
and its cognate a-satellite DNA derivative (147 base pairs). In the 
human CENP-A nucleosome, the DNA is wrapped around the 
histone octamer, consisting of two each of histones H2A, H2B, 
H4 and CENP-A, in a left-handed orientation. However, unlike 
the canonical H3 nucleosome, only the central 121 base pairs of 
the DNA are visible. The thirteen base pairs from both ends of the 
DNA are invisible in the crystal structure, and the aN helix of 
CENP-A is shorter than that of H3, which is known to be important 
for the orientation of the DNA ends in the canonical H3 nucleo- 
some’. A structural comparison of the CENP-A and H3 nucleo- 
somes revealed that CENP-A contains two extra amino acid 
residues (Arg 80 and Gly 81) in the loop 1 region, which is com- 
pletely exposed to the solvent. Mutations of the CENP-A loop 1 
residues reduced CENP-A retention at the centromeres in human 


Figure 1 | Crystal structure of the human CENP-A nucleosome. a-c, Three 
views of the CENP-A nucleosome structure are presented. a, View in the axis of 
the DNA supercoil; b, c, views from the side of the DNA supercoil. Two CENP- 


cells. Therefore, the CENP-A loop 1 may function in stabilizing 
the centromeric chromatin containing CENP-A, possibly by pro- 
viding a binding site for trans-acting factors. The structure pro- 
vides the first atomic-resolution picture of the centromere-specific 
nucleosome. 

Octasome and hemisome models have been proposed for the 
CENP-A nucleosome architecture". In the octasome model, two each 
of histones H2A, H2B, H4 and CENP-A form a histone octamer, and 
about 150 base pairs of DNA are wrapped in a left-handed orientation 
around the histone octamer, as in the canonical H3 nucleosomes’*””. 
In the hemisome model, however, one each of histones H2A, H2B, H4 
and CENP-A'™! form a heterotypic tetramer, and the DNA is 
wrapped in a right-handed orientation around the tetramer”. 

To reveal the architecture of the CENP-A nucleosome, we prepared 
the nucleosome using bacterially expressed human histones H2A, H2B, 
H4 and CENP-A’*” (Supplementary Fig. 1) and a 147-base-pair 
palindromic DNA, which was designed from a human «-satellite 
sequence containing a binding site for the centromeric protein, CENP- 
B (Supplementary Fig. 2). The CENP-A nucleosome was reconstituted 
by a salt-dialysis method, and crystallized as described (Supplementary 
Methods). The structure was determined at 3.6 A resolution (Fig. 1, 
Supplementary Table 1). The crystal structure revealed a histone octamer 
containing two each of histones H2A, H2B, H4 and CENP-A, with the 
DNA wrapped in a left-handed orientation around the histone octamer 


aw - 
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A molecules are shown in magenta and green, respectively. The central 121- 
base-pair DNA region, which is visible in the CENP-A nucleosome structure, is 
shown in dark blue. 
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(Fig. 1). The overall structure is quite similar to those of previously solved 
nucleosomes containing other histone H3 variants'*”*”*. The left-handed 
DNA wrapping in the crystal structure was also observed in biochemical 
experiments. CENP-A oligonucleosomes assembled on circular plasmid 
DNA by a salt-dialysis method or in the presence of human histone 
chaperones, NAP1 and somatic nuclear autoantigenic sperm protein 
(sNASP)**”°, under physiological salt conditions, introduce left-handed 
(negative) supercoils in the DNA (Supplementary Figs 3 and 4). These 
results strongly indicate that the DNA wrapped in a left-handed orienta- 
tion is the predominant form in the human CENP-A nucleosome. 

In the free CENP-A-H4 structure reported previously”®, the CENP- 
A-CENP-A interface is substantially rotated relative to the H3-H3 
interface, indicating that the CENP-A-H4 tetramer may be more 
compact than the H3-H4 tetramer. However, this specific shape of 
the CENP-A-CENP-A interface may only be observed in the free 
CENP-A-H4 tetramer, because the CENP-A-CENP-A interface in 
the present structure was nearly identical to that of the H3-H3 inter- 
face in the canonical H3 nucleosome. Consistently, small-angle X-ray 
scattering (SAXS) measurements of CENP-A and H3 nucleosomes 
generated nearly identical SAXS curves and distance distribution func- 
tions (Supplementary Fig. 5). These results indicate that the global 
structures of the CENP-A and H3 nucleosomes are quite similar. In 
addition, a compact CENP-A nucleosome model containing 147 base- 
pair DNA constructed from the free H3-H4 tetramer structure*® 
generated a significantly different SAXS curve and distance distri- 
bution function from those of a CENP-A nucleosome model contain- 
ing 147 base pair DNA based on the CENP-A nucleosome crystal 
structure, which was very similar to the experimental data (Sup- 
plementary Fig. 6). Therefore, the CENP-A nucleosome structure is 
probably not compacted as was suggested previously”®. 

Although the overall structure of the CENP-A nucleosome resembles 
that of the canonical H3 nucleosome, in the CENP-A nucleosome struc- 
ture, only the central 121 base pairs of the DNA are visible, and thus the 
thirteen base pairs from both ends of the DNA are disordered (Fig. 1 and 
Supplementary Fig. 7). This observation indicates that the DNA regions 
at the entrance and exit of the CENP-A nucleosome lack a fixed con- 
formation. This is consistent with the previous report showing that the 
DNA segments at the entrance and exit of the CENP-A nucleosome are 
more flexible than those of the canonical H3 nucleosome****. We found 
that the CENP-A nucleosome can be reconstituted on a 121 base pair 
DNA (Supplementary Fig. 8). The CENP-A nucleosome induced super- 
coils into plasmid DNA less efficiently than the H3 nucleosome, which 
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also indicates that the DNA is partially unwrapped in the CENP-A 
nucleosomes (Supplementary Fig. 9). Furthermore, the exonuclease 
assay revealed that the DNA ends of the CENP-A nucleosomes were 
more susceptible to exonuclease III digestion, compared to those of the 
H3 nucleosome (Supplementary Fig. 10). Similar results were obtained 
with other DNA sequences without the CENP-B box (Supplementary 
Fig. 10). Moreover, the SAXS data showed that the maximum dimension 
Dyax (Dmax = 165 A) of the CENP-A nucleosome is slightly longer than 
that of the H3 nucleosome (Dax = 147 A), which probably reflects the 
unwrapped conformations of the thirteen base pairs from both ends of 
the DNA in the CENP-A nucleosome (Supplementary Fig. 5). 

This difference in the DNA end structures can be explained by the 
structural differences between the amino-terminal regions of CENP-A 
and H3 (Fig. 2a). Previous crystal structures of the canonical H3 
nucleosome revealed that the loop segment preceding the oN helix 
of H3 interacts with the ends of the DNA, and seems to stabilize their 
orientations (Fig. 2b)'**?”*. Thus, the length of the «N helix seems to be 
an important factor for maintaining the DNA orientation at the 
entrance and exit of nucleosomes. The &N helix of CENP-A is at least 
one helical turn shorter than that of H3, and the preceding region is 
completely disordered (Fig. 2b). The DNA conformations at the 
entrance and exit of the nucleosome are clearly related to the organ- 
ization of the nucleosomes”’, particularly within the heterochromatin, 
where the nucleosomes are presumably tightly packed. A previous 
study indicated that the centromeric DNA binding protein, CENP- 
B, binds efficiently to its recognition sequence”, if the sequence is 
located near the entrance or exit of the CENP-A nucleosome’®. The 
flexible nature of both ends of the DNA in the CENP-A nucleosome 
structure may be an inherent property that facilitates the binding of 
CENP-B, and possibly other centromeric DNA-binding proteins. 
Thus, the «N helix of CENP-A may have a key role in allowing the 
DNA to adopt the specific conformations at the entrance and exit of the 
nucleosome, which are not observed in the canonical H3 nucleosome. 

It is notable that CENP-A-DNA contacts may exist at the flexible 
DNA ends of the CENP-A nucleosome. In the exonuclease digestion 
experiments (Supplementary Fig. 9), we observed DNA fragments 
longer than 121 base pairs, suggesting that the CENP-A-DNA con- 
tacts may extend beyond 121 base pairs of DNA. Furthermore, the 
CENP-A nucleosome reconstitution efficiency was slightly lower with 
a 121-base-pair DNA, compared to that with a 147-base-pair DNA 
(Supplementary Fig. 8). A possible interpretation is that the CENP-A- 
DNA contacts extending beyond 121 base pairs of DNA may stabilize 
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Figure 2 | Structure of the DNA entrance and exit of the human CENP-A 
nucleosome. a, Secondary structure of CENP-A in the nucleosome. The 
sequences of human CENP-A and H3 are aligned with the secondary structure 
elements. b, Close-up views of the oN helices and the DNA edge regions of the 


CENP-A (left panel) and H3 (right panel) nucleosomes. The dashed line in the 
left panel shows the DNA region that is not visible in the crystal structure. The 
CENP-A and H3 molecules are shown in magenta and orange, respectively. 
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the CENP-A nucleosome, resulting in higher reconstitution efficiencies 
on a 147-base-pair DNA. 

A superimposition of CENP-A and H3 reveals a clear difference in 
the loop 1 region (residues Phe 78-Phe 84 of CENP-A), where CENP- 
A has two extra amino acid residues (Arg 80 and Gly 81) compared to 
H3 (Fig. 2a). The CENP-A loop 1 protrudes from the CENP-A nucleo- 
some, and the Arg 80 and Gly 81 residues are located at the tip of the 
loop (Fig. 3 and Supplementary Fig. 11). In the free CENP-A-H4 
tetramer structure, the loop 1 region is more flexible than the other 
CENP-A regions, as judged from the B-factors. By contrast, in the 
CENP-A nucleosome, the B-factors of the loop 1 region are similar 
to those of the other regions (Supplementary Fig. 12), indicating that 
CENP-A nucleosome formation may stabilize the loop 1 region. The 
tip of the loop 1 region is solvent-accessible (Fig. 3), and may function 
as a binding site for trans-acting factors that interact directly with the 
CENP-A nucleosome. To test the functional significance of the CENP- 
A Arg 80 and Gly 81 residues, we co-expressed the green fluorescent 
protein (GFP)-tagged CENP-A and the red fluorescent protein (RFP)- 
tagged CENP-A(del), in which the Arg80 and Gly 81 residues were 
deleted, in human-telomerase-immortalized retina pigment epithelial 
(hTERT-RPE1) cells. Within 1 or 2 days after transfection, both GFP- 
tagged CENP-A and RFP-tagged CENP-A(del) were recruited to the 
centromeres, which were identified by a constitutive centromere protein, 
CENP-C (Fig. 4a). This result indicates that the Arg 80 and Gly 81 


Figure 3 | Structural differences in the loop 1 regions between CENP-A and 
H3. a, Side view of the CENP-A nucleosome. The CENP-A molecules are 
shown in magenta and green. The box indicates the region enlarged in panel 
b. b, Superimposition of the CENP-A (magenta) and H3 (orange) loop 1 
regions. Arrows indicate the tip of the CENP-A loop 1 containing the Arg 80 
and Gly 81 residues. 
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residues are not essential for targeting CENP-A to the centromeres. 
However, 3 days after transfection, the number of cells in which the 
CENP-A(del) signals were detected at the centromeres was markedly 
reduced (Fig. 4b). Concomitantly, the number of cells with the CENP- 
A signal alone increased (Fig. 4b). Similar results were obtained when 
the fluorescent labels were swapped between CENP-A and CENP- 
A(del), showing that the phenomenon does not depend on the fusion 
partner (Fig. 4c). These results indicate that CENP-A(del) is less 
stably incorporated into centromeres, compared to CENP-A. In addi- 
tion, two CENP-A mutants, one containing the Arg80-Gly 81 to 
Ala 80-Ala 81 substitution (CENP-A(A80A81)) and another with the 
Val 82-Asp 83 deletion, which disrupts the Arg 80-Gly 81 protrusion 
(CENP-A(del82-83)), were targeted to centromeres at levels com- 
parable to those of CENP-A, 1 day after transfection. The number of 
cells retaining the CENP-A mutants at the centromeres also decreased, 
3 days after transfection (Supplementary Fig. 13a, b). However, like 
CENP-A, the CENP-A mutant containing the Val 82-Asp 83 to Ala82- 
Ala83 substitution (CENP-A(A82A83)) remained at the centromeres, 
3 days after transfection (Supplementary Fig. 13c). Thus, the Arg 80 
and Gly 81 residues and the size of the protruding loop 1 are critical for 
stable CENP-A retention at centromeres. 

There has been much debate over the CENP-A nucleosome struc- 
ture and its role in the centromere-specific chromatin structure. 
Because CENP-A has lower sequence homology to H3, compared to 
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Figure 4 | Less stable association of CENP-A(del) with the centromere. 

a, Fluorescence images. hTERT-RPE1 cells were transfected with GFP-tagged 
CENP-A and RFP-tagged CENP-A(del), fixed, and stained with anti-CENP-C 
(Cy5) and DAPI. Bar, 10 um. b, Quantitative data. Using images such as those 
in panel a, the numbers of transfected hTERT-RPE1 cells showing GFP- 
CENP-A, RFP-CENP-A(del), or both at centromeres were counted (n > 28), 
and the average percentages from three independent transfections were plotted 
with the standard deviations. c, hIERT-RPEI cells were transfected with GFP- 
tagged CENP-A(del) and RFP-tagged CENP-A, and were analysed as described 
in panel b (n> 20). 
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other H3 variants, the possibility of a CENP-A nucleosome composed 
of one of each core histone (hemisome) is an attractive proposition. 
However, our findings support the octasome model for the CENP-A 
nucleosome. It is still possible that both types of CENP-A nucleo- 
somes, octasome and hemisome, coexist in the functional centromeric 
chromatin in vivo. We also cannot exclude the possibility that CENP- 
A hemisomes can be reconstituted under different conditions and/or 
with factor(s) required for their assembly. Nevertheless, the present 
structure suggests that the fundamental principles involved in nucleo- 
some formation are likely to be similar among the H3 variants, includ- 
ing CENP-A. The flexibility exclusively observed in the DNA regions 
located at the entrance and the exit of the CENP-A nucleosome and the 
loop 1 region protruding from the CENP-A nucleosome may have an 
essential role in the centromeric chromatin architecture. 


METHODS SUMMARY 


Human CENP-A, H2A, H2B, H3.1 and H4 were overexpressed in Escherichia coli 
cells, and were purified by a method described previously”’”***. Details are pro- 
vided in Methods. The 147-base-pair DNA used in the CENP-A nucleosome 
reconstitution was prepared by self-ligation with the 71-base-pair fragment of a 
human «-satellite sequence’**', containing an extra 5-base overhang, 5’-GTAAC-3’, 
for the cohesive end. The resultant 147-base-pair DNA contained the CENP-B box 
near both edges, and an A:A mismatch was located at the centre of the DNA 
(Supplementary Fig. 2). The preparation, crystallization and structural determina- 
tion of the CENP-A nucleosome are described in Methods. Analyses of the fluor- 
escent protein-tagged CENP-A or CENP-A mutant incorporation at centromeres 
were performed using hTERT-RPE1 cells. Details are described in Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Overexpression of human histones. Human histones H2A and H2B were pro- 
duced in Escherichia coli BL21(DE3) cells, and histone H4 was produced in E. coli 
JM109(DE3) cells. Human CENP-A was produced in E. coli DH5« cells. All 
histones and CENP-A were produced in E. coli cells in the absence of T7 RNA 
polymerase by omitting the addition of isopropyl-B-p-thiogalactopyranoside, 
which induces the T7 RNA polymerase production in BL21(DE3) and 
JM109(DE3) cells. All histones and CENP-A were produced as N-terminal 
Hisg-tagged proteins, as described previously”’. The His, tags of all histones were 
removed by thrombin protease digestion, leaving a Gly-Ser-His sequence at the 
N-terminal end of each histone. 

For the purpose of structural determination, selenomethionine (Se-Met)- 
substituted H2B was produced in E. coli B834(DE3) cells, using the pET15b vector 
system (Novagen). The B834(DE3) cells were grown in 100 ml of LB medium for 
4h at 37 °C. The cells were collected and transferred into 300 ml of M9 medium 
(+50 pg ml! Se-Met). After 12h growth at 37 °C, the 300-ml culture was added 
to 21 of M9 medium (+50 pg ml | Se-Met), and the culture was continued at 
37 °C. When the cell density reached 0.5 (Dgoo), isopropyl-B-p-thiogalactopyranoside 
(final concentration 1 mM) was added, to induce the expression of H2B. The cells 
were grown further at 37 °C for 12h. 

Purification of human histones. The cells producing recombinant histones were 
collected, and were resuspended in 50 ml of buffer A (50 mM Tris-HCl (pH 8.0), 
500 mM NaCl, 1mM PMSF and 5% glycerol). The cells were disrupted by two 
rounds of sonication for 200 s each. The cell lysates were centrifuged at 27,216g for 
20 min at 4 °C. The supernatants were discarded, and the pellet containing the Hisg- 
tagged histones was resuspended in 50 ml of buffer A, containing 7M guanidine 
hydrochloride. The samples were rotated for 12 h at 4 °C, and the supernatants were 
recovered by centrifugation at 27,216g for 20 min at 4 °C. The supernatants contain- 
ing the Hisg-tagged histones were combined with 4ml (50% slurry) of nickel- 
nitrilotriacetic acid (Ni-NTA) agarose resin (Qiagen), and the samples were rotated 
for 1 hat 4 °C. The agarose beads were then washed with 100 ml of buffer B (50 mM 
Tris-HCl (pH 8.0), 500 mM NaCl, 6 M urea, 5 mM imidazole, and 5% glycerol). The 
His,-tagged histones were eluted by a 100-ml linear gradient of 5 to 500mM 
imidazole in buffer B, and the samples were dialysed against buffer C (5mM 
Tris-HCl (pH 7.5) and 2mM 2-mercaptoethanol). The N-terminal His, tags were 
removed from the histones by thrombin protease treatment (1 unit mg! of his- 
tones; GE Healthcare) at room temperature for 3 h. The removal of the Hisg tags was 
confirmed by SDS-16% polyacrylamide gel electrophoresis (PAGE); the recombin- 
ant histones without the His, tag migrated faster than the His,-tagged histones. 
After the His, tag was uncoupled, each histone was subjected to Mono S column 
chromatography (GE Healthcare). The column was washed with buffer D (20 mM 
sodium acetate (pH 5.2), 200 mM NaCl, 5 mM 2-mercaptoethanol, 1 mM EDTA, 
and 6 M urea), and each histone was eluted by a linear gradient of 200 to 800 mM 
NaCl in buffer D. The purified histones were dialysed against water, and were 
freeze-dried. 

Preparation of DNAs. The 147-base-pair DNA, which was used for reconstituting 
the CENP-A nucleosome, is a derivative of the human o-satellite DNA (sat4)*?. 
The EcoRI site (GAATTC) of the sat4 sequence was replaced by a BstPI site 
(GGTAACC). The 71-mer DNA fragment containing the 5’ half of the sat4 
sequence, with the CENP-B box at the edge, was ligated in tandem in the plasmid 
(p5’Sat4-24). The 71-mer DNA fragment containing an extra 5-base overhang, 
5'-GTAAC-3’, was prepared for self-ligation according to the method described 
previously*’. The 71-mer DNA fragment containing the 5-base overhang was self- 
ligated, and the palindromic 147-base-pair o-satellite DNA derivative was pre- 
pared. The 147-base-pair DNA sequence is: 5’-ATCCTTCGTTGGAAACG 
GGATTTCTTCATTTCATGCTAGACAGAAGAATTCTCAGTAACTTCTTTG 
TGCTGGTAACCAGCACAAAGAAGTTACTGAGAATTCTTCTGTCTAGCAT 
GAAATGAAGAAATCCCGTTTCCAACGAAGGAT-3’. 

In this palindromic 147-base-pair o.-satellite DNA derivative, an A:A mismatch 
was introduced at the centre of the 147-base-pair DNA fragment (underlined). 
Preparation of the CENP-A nucleosome. The purified H2A-H2B (Se-Met)- 
CENP-A-H4 (0.9 mg) and the 147-base-pair DNA (1 mg) were mixed in a solution 
containing 2M KCl, and the sample was dialysed against dialysis buffer (10 mM 
Tris-HCl (pH 7.5), 1 mM EDTA, 1 mM dithiothreitol and 2 M KCl). After dialysis 
at 4°C for 3h, the KCl concentration of the dialysis buffer was gradually decreased 
to 250 mM with a peristaltic pump (0.8 ml min_' flow rate). The sample was then 
dialysed against 10 mM Tris-HCl buffer (pH 7.5), containing 1 mM EDTA, 1mM 
dithiothreitol and 250 mM KCl, at 4 °C for 3h. After this dialysis step, the sample 
was incubated at 55 °C for 2 h. The CENP-A nucleosome was purified from the free 
DNA and histones by non-denaturing polyacrylamide gel electrophoresis, using a 
Prep Cell apparatus (Bio-Rad). The purified CENP-A nucleosome was concen- 
trated, and was dialysed against 20 mM potassium cacodylate buffer (pH 6.0) con- 
taining 1 mM EDTA. 


Crystallization and structure determination. Crystals of the purified CENP-A 
nucleosome were obtained by the hanging drop method, after mixing equal 
volumes of the CENP-A nucleosome solution and 20 mM potassium cacodylate 
buffer (pH 6.0), containing 60-96 mM KCl and 135-144 mM MnCl,. The CENP- 
A nucleosome sample was equilibrated against a reservoir solution of 20mM 
potassium cacodylate (pH 6.0), 38-56 mM KCl, and 70-75 mM MnCh. Crystals 
of the CENP-A nucleosome were soaked in a cryoprotectant solution, containing 
20mM potassium cacodylate (pH 6.0), 47mM KCl, 72mM MnCl, 30% poly- 
ethylene glycol 400, and 5% trehalose. The crystals were flash-cooled in a stream 
of N32 gas (100 K). The CENP-A nucleosome crystals belonged to the monoclinic 
space group P2,, with unit cell constants of a = 65.8 A, b= 83.3 A, c=176.8Aand 
/ = 100.7°, and contained one nucleosome in the asymmetric unit. High-resolution 
diffraction data were obtained using the synchrotron radiation source at the beam- 
line BL41XU station of SPring-8, Harima, Japan. 

Diffraction data of the CENP-A nucleosome were integrated and scaled with the 

HKL2000 program”. The data were processed with the CCP4 program suite**. The 
structure was solved by the molecular replacement method, using the MOLREP 
program™ and the human nucleosome structure (PDB accession number 3AFA) 
as a search model’*. Most of the amino acid side chains were clearly visible in the 
map initially calculated at 3.6 A resolution. Rigid body refinement of the obtained 
solution was performed using the CNS program”’. Further structural refinement 
consisted of iterative rounds of energy minimization and B factor refinement using 
the CNS program”, and model building using the COOT program”. The 
Ramachandran plot of the final structure showed 98.7% of the residues in the 
most favourable and additional allowed regions, and no residues in the disallowed 
region. Summaries of the data collection and refinement statistics are provided in 
Supplementary Table 1. All structure figures were created using the PyYMOL 
program (http://pymol.org). The atomic coordinates of the CENP-A nucleosome 
have been deposited in the Protein Data Bank, with the ID code 3AN2. 
Supercoiling assay. Salt-dialysis supercoiling assay. Relaxed plasmid DNA 
(500 ng) was mixed with 0, 125, 250, 500 and 1,000 ng of histone octamer in 
5 pl of 20mM Tris-HCl (pH 7.5) buffer, containing 1mM EDTA, 0.2 mg ml! 
BSA, and 2M NaCl. The samples were then incubated at 37 °C for 30 min. The 
NaCl concentration of the sample was reduced to 1 M, 0.8 M, 0.67 M, and 0.2 M by 
adding dilution buffer, containing 20 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.2 mg 
ml’ BSA, 5mM MgCh, and 0.06Uul * calf (Invitrogen) or wheat germ 
(Promega) topoisomerase I. The samples were incubated at 37 °C for 30 min in 
each dilution step. 
Chaperone-mediated supercoiling assay. NAP1 or sNASP (0.25, 0.5, and 1.0 uM) 
was pre-incubated with H2A—-H2B (150 ng) and CENP-A-H4 (150 ng) at 37 °C 
for 15 min. Supercoiled plasmid DNA (100 ng), which was relaxed with a topoi- 
somerase I solution (10 mM Tris-HCl (pH 8.0), 2 mM MgCl, 5 mM dithiothreitol, 
and2U wl? wheat germ topoisomerase I (Promega)), was added to the reaction 
mixture. The samples were then incubated at 37 °C for 60 min in 10 mM Tris-HCl 
(pH 8.0) buffer, containing 140 mM NaCl, 2mM MgCh, and 5 mM dithiothreitol, 
followed by an incubation at 42 °C for 60 min. 

In both the salt-dialysis and chaperone-mediated assays, after the reaction, the 
samples were treated with 50 ul of a proteinase K solution (20mM Tris-HCl 
(pH8.0), 20mM EDTA, 0.5% SDS, and 0.5 mgml ' proteinase K (Roche)) at 
37°C for 30min. The DNA was extracted with phenol/chloroform. The DNA 
was then precipitated by ethanol, and was analysed by one-dimensional gel elec- 
trophoresis on a 1% agarose gel in 1 X TAE buffer (for the salt-dialysis assay, 
1.3Vcm ! for 15.5h) or 1X TBE buffer (for the chaperone-mediated assay, 
1.3Vcm!? for 15.5 h). For the two-dimensional gel electrophoresis, the DNA 
was electrophoresed on a 0.7% agarose gel in 1X TBE buffer (for the salt-dialysis 
assay, 2Vcm ' for 7h) or a 1% agarose gel in 1X TBE buffer (for the chaperone- 
mediated assay, 1.3 Vcm ! for 15h) for the first dimension. The gel was then 
soaked in 1X TBE buffer containing 4 mg! ' of chloroquine for 3h. The samples 
were subsequently electrophoresed in 1X TBE buffer containing 4mgl~' of 
chloroquine (1.3 Vem! for 12h (for the salt-dialysis assay) or 1.3 Vcm | for 
15h (for the chaperone-mediated assay)) for the second dimension. The DNA was 
visualized by SYBR Gold (Invitrogen) staining. 

Nucleosome reconstitution by the salt-dialysis method for biochemical ana- 
lyses. The purified H2A~H2B-CENP-A-H4 or H2A—H2B-H3-H4 octamer was 
mixed with a DNA fragment (300 ug, 121-base-pair DNA or 147-base-pair DNA) 
in a solution containing 2 M KC] (376 pl). The amounts of histone octamers were 
420g for the 121-base-pair DNA and 384,1g for the 147-base-pair DNA. 
Nucleosomes were reconstituted and prepared by the same method as described 
in the ‘Preparation of the CENP-A nucleosome’ section. 

Competitive nucleosome assembly assay. The purified H2A-H2B-CENP-A- 
H4 octamer (14, 28, 42 or 56 lg) was incubated in the presence of both the 147- 
base-pair DNA (24 1g) and 121-base-pair DNA (20 11g), in a solution containing 
2M KCL, and the sample was dialysed against dialysis buffer (10 mM Tris-HCl 
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(pH7.5), 1mM EDTA, 1 mM dithiothreitol, and 2M KCl). After dialysis at 4°C 
for 3h, the KCl concentration of the dialysis buffer was gradually decreased to 
250 mM with a peristaltic pump (0.8 ml min | flow rate). The sample was then 
dialysed against 10 mM Tris-HCl buffer (pH 7.5), containing 1 mM EDTA, 1 mM 
dithiothreitol, and 250 mM KCl, at 4°C for 3h. The CENP-A nucleosomes were 
then analysed by 6% PAGE in 0.2X TBE buffer (18 mM Tris base, 18 mM boric 
acid, and 0.4mM EDTA) at 16Vcm ! for 1h, followed by ethidium bromide 
staining. 

Nucleosome disruption assay. The CENP-A nucleosomes were reconstituted 
with a 121-base-pair or 147-base-pair palindromic «-satellite derivative, by the 
salt dialysis method. The 121-base-pair DNA lacks the 13-base-pair regions from 
both edges of the 147-base-pair DNA used in the crystallography of the CENP-A 
nucleosome. The rest of the 121-base-pair DNA sequence is identical to the 147- 
base-pair palindromic «satellite derivative. The CENP-A nucleosomes (150 ng) 
were incubated at 37 °C, 57 °C, 67 °C, 70 °C, or 73 °C for 15 min in the presence of 
supercoiled plasmid DNA (100 ng). After the incubation, the CENP-A nucleo- 
somes that were not disrupted were separated by non-denaturing 6% PAGE, and 
were visualized by ethidium bromide staining. The relative band intensities for the 
CENP-A nucleosomes were quantified and plotted against the temperature. 
Exonuclease assay. The reconstituted CENP-A or H3 nucleosomes were treated 
with 3 units of Escherichia coli exonuclease III (Takara), in 10 pl of 50 mM Tris- 
HCl (pH 8.0), 5mM MgCl, and 1mM DTT. After an incubation for 0, 2, 4, or 
8 min at 37 °C, the reaction was stopped by the addition of 55 jl of proteinase K 
solution (20 mM Tris-HCl (pH 8.0), 20 mM EDTA, 0.5% SDS, and 0.5 mg ml 
proteinase K (Roche)). After a 15 min incubation at room temperature, the DNA 
was extracted with phenol/chloroform, precipitated with ethanol, dissolved in Hi- 
Di Formamide (Applied Biosystems), and then analysed by 10% denaturing PAGE 
with a gel containing 7 M urea in 0.5X TBE buffer (21 Vm! for 1.5h). 
Small-angle X-ray scattering (SAXS). SAXS measurements of the reconstituted 
CENP-A and H3 nucleosomes, in 20mM Tris-HCl buffer (pH 7.5) containing 
1mM EDTA and 1mM DTT, were performed at the RIKEN structural biology 
beam-line I (BL45XU) of SPring-8 (Hyogo, Japan)’’. Scattering intensities of the 
nucleosome solutions were measured with an R-AXIS IV‘* imaging plate 
detector at 20°C with a sample-to-detector distance of 3,529mm, which was 
calibrated by the powder diffraction from silver docosanoate. Circular averaging 
of the scattering intensities was then performed to obtain the one-dimensional 
scattering data I(q) as a function of q (q = 4msin0//), where 20 is the scattering 
angle and the X-ray wavelength 1 = 0.9 A). Three successive measurements were 
made for each solution, with an exposure time of 60 s. The resultant three data sets 
were combined after inspections for X-ray radiation damage to the solution and 
the existence of instrumental artefacts. SAXS measurements of the buffer solution 
for background subtraction were performed after each measurement of the 
nucleosome solutions, using the same conditions and procedure as those of the 
nucleosome solutions. To correct the inter-particle interference effect, I(q) data 
were collected at four protein concentrations (0.5, 0.7, 1.0 and 1.3 mg ml), and 
extrapolated to zero concentration. The data were processed and analysed using 
the software applications embedded in the ATSAS package (http://www. 
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embl-hamburg.de/biosaxs/software.html). The radius of gyration, Rg, was esti- 
mated by fitting the I(q) data using the Guinier approximation I(q) = I(0) 
exp(—q°R,’/3), where 1(0) is the forward scattering at the zero scattering angle, 
in a smaller angle region of qR, < 1.3. Error of Rg was estimated from the least- 
squares fitting. The distance distribution function P(r) and its error were calcu- 
lated by the program GNOM®. The maximum dimension D,,,, was estimated 
from the P(r) function as the distance r, where P(r) = 0 (ref. 39), and its error was 
estimated from the errors of the P(r) values around P(r) = 0. 

Centromere localization of CENP-A and CENP-A mutants. hTERT-RPE1 cells 
were transfected with combinations of wild-type CENP-A and CENP-A(del), in 
which two amino acid residues (the Arg 80 and Gly 81 residues of the CENP-A 
loop 1) were deleted, and tagged with either GFP or RFP, using GeneJuice (Merck) 
according to the manufacturer’s instructions. hTERT-RPE1 cells were also trans- 
fected with combinations of wild-type CENP-A tagged with RFP and CENP- 
A(del82-83) (where the Val 82 and Asp 83 residues of the CENP-A loop 1 were 
deleted), or CENP-A(A80A81) (where the Arg80 and Gly81 residues were 
replaced by Ala80 and Ala81), or CENP-A(A82A83) (where the Val82 and 
Asp 83 residues were replaced by Ala 82 and Ala 83), tagged with GFP. The cells 
were fixed with 4% paraformaldehyde 1-3 days after transfection, permeabilized, 
and stained with guinea pig anti-CENP-C* and donkey Cy5-conjugated anti- 
guinea pig Ig (Jackson ImmunoResearch). DNA was counterstained with 12.5 ng 
ml ' DAPI. The fluorescence images were collected using an inverted microscope 
(Ti-E; Nikon) with a X100 PlanApo VC numerical aperture (NA) = 14 oil- 
immersion objective lens, or a X40 PlanApo NA = 0.95 dry lens, equipped with 
an EM-CCD camera (iXon+; Andor). The numbers of transfected cells exhibiting 
the GFP- or RFP-tagged protein, or both, at the centromeres were counted, and the 
average percentages from three independent transfections were plotted with the 
standard deviations. 
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A two-step chemical mechanism for 
ribosome-catalysed peptide bond formation 


David A. Hiller', Vipender Singh'+, Minghong Zhong'} & Scott A. Strobel! 


The chemical step of natural protein synthesis, peptide bond 
formation, is catalysed by the large subunit of the ribosome. 
Crystal structures have shown that the active site for peptide bond 
formation is composed entirely of RNA’. Recent work has focused 
on how an RNA active site is able to catalyse this fundamental 
biological reaction at a suitable rate for protein synthesis. On the 
basis of the absence of important ribosomal functional groups’ , 
lack of a dependence on pH’, and the dominant contribution of 
entropy to catalysis*, it has been suggested that the role of the 
ribosome is limited to bringing the substrates into close proximity. 
Alternatively, the importance of the 2’-hydroxyl of the peptidyl- 
transfer RNA® and a Bronsted coefficient near zero® have been 
taken as evidence that the ribosome coordinates a proton-transfer 
network. Here we report the transition state of peptide bond 
formation, based on analysis of the kinetic isotope effect at five 
positions within the reaction centre of a peptidyl-transfer RNA 
mimic. Our results indicate that in contrast to the uncatalysed 
reaction, formation of the tetrahedral intermediate and proton 
transfer from the nucleophilic nitrogen both occur in the rate- 
limiting step. Unlike in previous proposals, the reaction is not fully 
concerted; instead, breakdown of the tetrahedral intermediate 
occurs in a separate fast step. This suggests that in addition to 
substrate positioning, the ribosome is contributing to chemical 
catalysis by changing the rate-limiting transition state. 

Several reaction mechanisms have been proposed for peptide bond 
formation (Fig. 1). The peptidyl transferase reaction occurs through 
nucleophilic attack of the «-amino group of aminoacyl-tRNA on the 
carbonyl carbon of the peptidyl-tRNA. A peptide bond forms and the 
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ester bond linking the peptide to the 3'-oxygen of peptidyl-tRNA breaks, 
leaving a deacylated tRNA and a peptide lengthened by one amino acid. 
If substrate positioning is the sole contribution to catalysis, the mech- 
anism within the ribosome is expected to be equivalent to the well- 
studied uncatalysed reactions, where a pathway involving two tetrahedral 
intermediates is followed (T+ and T—, Fig. 1; black pathway, see figure 
legend for explanation of mechanisms and intermediates). At high pH, 
the rate-limiting transition state for the uncatalysed reaction was pre- 
dicted’ to occur during deprotonation of the zwitterionic T+ intermedi- 
ate, on the basis of the pH-rate dependence of the reaction and a Bronsted 
coefficient near 1 (transition state D in Fig. 1). At low pH, breakdown of 
the negatively charged T— is rate-limiting (transition state F in Fig. 1). 
It has also been suggested*” that the peptidyl-tRNA 2’ -hydroxyl acts 
as a ‘proton shuttle’ that abstracts a proton from the nucleophilic 
amine and simultaneously donates another proton either directly or 
indirectly to the leaving group oxygen. This could happen ina stepwise 
fashion (Fig. 1, red and orange pathways), involving a tetrahedral 
intermediate. Alternatively, the reaction has been proposed to be fully 
concerted, with the amide bond to the nucleophile formed at the same 
time that the ester bond to the leaving group is broken (Fig. 1, green 
pathway). Recently, computational methods were used to evaluate 
concerted proton shuttle mechanisms with and without an additional 
water molecule, and favoured a fully concerted eight-member proton 
shuttle (similar to transition state A)'°. Alternatively, other computa- 
tional studies have indicated a stepwise proton shuttle would be 
favourable’. All of these possibilities have been proposed**”’. 
Each mechanism predicts a different transition state and hence a 
different role for the ribosome in catalysis. Therefore we sought to 
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Figure 1 | Proposed reaction mechanisms. Ground states are shown in black, 
intermediates in grey and transition states in blue. The ribosomal reaction may 
be fully stepwise with two intermediates, like the uncatalysed reaction (black 
pathway). Nucleophilic attack leads to an intermediate with a positively 
charged nitrogen and negatively charged carbonyl oxygen (T+). 


Deprotonation leads to a negatively charged intermediate (T—), which breaks 
down to products. Alternatively, one of these steps may be concerted, with only 
one intermediate (red and orange pathways) or the reaction may be fully 
concerted with no intermediates (green). The identities of B1 and B2 are 
uncertain; the 2’-hydroxyl may be one, both or neither. 
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obtain experimental constraints for modelling the transition state of 
ribosome-catalysed peptide bond formation using kinetic isotope 
effect (KIE) analysis (reviewed in ref. 14). Generally, if the rate effect 
of isotopic substitution (defined as kjignt/kneayy) is greater than one 
(termed ‘normal’, it indicates that bonding to that isotope is weaker 
in the transition state. Reaction coordinate motion of a substituted 
atom will also contribute to a normal isotope effect. Alternatively, if 
the isotope effect is less than one (termed ‘inverse’), it indicates stronger 
bonding. The magnitude of the effect is correlated with the change in 
bond order. To make this analysis possible, we previously synthesized a 
complete series of molecules that differ by a single isotopic substitution 
at each atom within the reaction centre’*'®. These molecules are sub- 
strates for the ribosomal 50S fragment reaction'”'*. Previous studies 
indicate that chemistry is rate-limiting for this assay and that unlike the 
70S reaction, which has a complete commitment to catalysis, this 
reaction is amenable to KIE analysis’*. The Bronsted coefficient of 
the nucleophile and the structure of the active site are within experi- 
mental error for 50S and 70S ribosomes, indicating that the mechanism 
of catalysis is similar*””. 

To achieve the necessary precision, KIEs were measured by a com- 
petitive assay. The light and heavy substrates were incubated in the 
same reaction, and the change in their ratio as the reaction proceeded 
was used to determine their relative reaction rates. We used two remote 
radiolabels, °’P and **P, attached to the 5’-ends of the P-site substrate 
and performed scintillation counting to define the relative abundance 
of the two substrates, as previously described for the uncatalysed reac- 
tion”, 

Given the small magnitude of the expected effects (typically in the 
range of 0.95 to 1.05) and the complexity of the experimental system (a 
1.5-MDa complex altered by 1-2 Da), we performed several controls to 
determine the extent of random and systematic error. First, we demon- 
strated that the effect from the remote label alone (1.001 + 0.002, 
mean = s.e.m.) is much smaller than the effects we expect for atoms 
at the reaction centre. This value and the effect of deuterium substi- 
tution at the amino acid a-carbon were determined 33 and 40 times, 
respectively. Repeated measurements in both cases yielded normally 
distributed data, with a significant difference between the mean of the 
control and the mean of the o-deuterium substitution (Fig. 2). 
Furthermore, two sets of measurements were made for each substi- 
tution, one pairing **P with the light isotope at the reaction centre and 
*°P with the heavy isotope, and a second with the opposite pairings— 
that is, °’P with the heavy isotope and **P with the light isotope. These 
determinations gave the same values within experimental error. 

We measured isotope effects for five positions at the reaction centre: 
the 3’-oxygen leaving group, the carbonyl carbon, the nucleophilic 
nitrogen, the hydrogen attached to the o-carbon, and the vicinal 
2'-hydroxyl (Fig. 3). The simplest effect to interpret is that of '*O 
substitution on the leaving group, which measures the extent of 
C-O bond dissociation in the transition state. Effects in the range of 
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Figure 2 | Histograms of measured isotope effects. Isotope effects on peptide 
bond formation for controls (solid bars, solid line) and «-deuterium 
substitution (hatched bars, dashed line) both fit to normal distributions. 
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Figure 3 | Kinetic isotope effects. Values are reported in upright font as 
mean + s.e.m., with the number of independent trials in parentheses. The 
second value for the nitrogen nucleophile (asterisked) was previously 
determined using mass spectrometry instead of scintillation counting as the 
readout. Calculated values for the transition state in Fig. 4 are shown in italics. 
Full substrates are shown: cyt, cytidine; cap, caproic acid; bio, biotin; pmn, 
puromycin. 


1.02 to 1.06 have been observed when cleavage of the C-O bond is rate- 
limiting’. Conversely, if formation of the amide bond or deprotona- 
tion of the T+ intermediate is the rate-limiting step, then a maximum 
effect of only 1.01 is expected’*. We observed a small effect (1.006) on 
3'-180 substitution, which indicates that there is not significant C-O 
bond-breaking in the rate-limiting step. This is inconsistent with transi- 
tion state A, the mechanism in which bond formation to the nucleophile 
is concerted with breaking the bond to the leaving group, and limits the 
possible transition states to B, C and D (Fig. 1). Notably, the leaving- 
group oxygen effect previously predicted for fully concerted proton- 
shuttle mechanisms varied from 1.023 to 1.038 (ref. 10; transition state 
A), which is inconsistent with the value of 1.006 measured here. 

The primary carbonyl-'*C isotope effect is derived from a combina- 
tion of bond breaking (that is, the bond to the leaving group oxygen), 
bond formation (to the nitrogen) and the accompanying change in 
hybridization. Given that the bond to the leaving group is intact, the 
last two factors should dominate. The observed effect for the carbonyl 
carbon is large and normal, 1.026. For hydrazinolysis of methyl formate, 
a similar effect has been interpreted as resulting from rate-limiting C-N 
bond formation™*. However, it was also shown computationally that 
an effect of approximately 1.02 could be derived from rate-limiting 
deprotonation of the zwitterionic intermediate (transition state D)*°. 
The remaining isotope effects made it possible to differentiate between 
these possibilities. 

The deuterium effect on the amino acid «-carbon is primarily 
derived from the loss of hyperconjugation between the carbonyl 
m-bond and the antibonding orbital of the C-H bond, which increases 
the strength of the C-H bond. The magnitude of this effect is largest 
when the ground state has a large geometric overlap between the 
m-bond and the antibonding orbital, and the transition state is very 
close to tetrahedral. Calculations and previous reactions indicate a 
lower limit of this isotope effect to be approximately 0.96 (refs 21, 
26). The measured isotope effect, 0.985, indicates a significant decrease 
in overlap in the transition state. This is consistent with the decreased 
m-bond character in a partially tetrahedral transition state. 

The KIE of the nucleophilic nitrogen was previously measured to be 
1.010, suggestive of an early transition state with partial C-N bond 
order'®. We remeasured this value using the remote radiolabel-based 
assay, and again obtained a normal isotope effect, 1.014. Formation of 
the C-N bond is expected to produce an inverse isotope effect on the 
nitrogen. The normal isotope effect observed can be derived from two 
factors: reaction coordinate motion and deprotonation of the nitrogen. 
Calculations here and elsewhere indicate that either of these factors 
alone is insufficient to result in a normal isotope effect (Supplementary 
Tables 1 and 2, and ref. 10). This suggests that the nitrogen is being 
deprotonated in the rate-limiting step, simultaneous with C-N bond 
formation (transition state C). Such a mechanism is consistent with the 


11 AUGUST 2011 | VOL 476 | NATURE | 237 


©2011 Macmillan Publishers Limited. All rights reserved 


LETTER 


near-zero Bronsted coefficient®, which indicates that there is no 
buildup of positive charge on the nucleophile in the transition state. 

The 2’-hydroxyl contributes 100- to 2,000-fold to the rate of 
ribosome-catalysed peptide bond formation’ and has been postulated 
to be involved in proton transfer*"*. The isotope effect for this position 
is close to unity (1.002). Changes in bond order between the 2'-oxygen 
and protons can be compensated by opposite changes in bond strength 
to the 2'-carbon; therefore this effect is relatively insensitive to the 
protonation state of the 2’-oxygen’’. The measured effect indicates 
that the oxygen does not bear a large positive or negative charge, 
consistent with previous studies’’. 

To supplement the qualitative descriptions above, we calculated 
isotope effects for several potential transition state structures (Sup- 
plementary Tables 1-3). Although we cannot exclude the possibility 
that multiple steps contribute to the isotope effects, a single transition 
state structure did yield calculated values in reasonable agreement with 
the experimental measurements (Fig. 4). As expected for a transition 
state, there is a single large imaginary frequency corresponding to 
reaction coordinate motion (carbon-nitrogen bond formation and 
nitrogen deprotonation). The error inherent in these calculations 
and our measurements produces an uncertainty in the precise struc- 
ture of the transition state; however, a single rate-limiting step is con- 
sistent with the measured isotope effects. Formation of the tetrahedral 
intermediate is simultaneous with deprotonation (transition state C). 
This step is followed by fast breakdown of the tetrahedral intermediate 
into products (red pathway in Fig. 1). The data are not consistent with 
a fully concerted reaction mechanism. 

The two-step mechanism identified here is markedly different 
from what has been observed for similar uncatalysed reactions””'. 
For uncatalysed reactions at low pH, stepwise formation of T— is 
rate-limiting. At high pH, or with the strong nucleophile hydroxyla- 
mine, breakdown of the T— intermediate is rate-limiting. Our results 
indicate that the ribosome not only alters the relative rates of formation 
and breakdown of tetrahedral intermediates, but also substantially 
alters the energy landscape such that nucleophilic attack and depro- 
tonation are coordinated. 

Concerted attack and deprotonation have also been proposed for 
aminolysis of the acyl-enzyme intermediate by chymotrypsin**”’. This 
conclusion was supported in part bya low Bronsted coefficient, similar 
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Figure 4 | Structures of the substrates and transition states for peptide bond 
formation shown in similar orientations. a, Electrostatic representations. 
Blue is electron-deficient (cationic) and red is electron-rich (anionic). 

b, Geometric representations. Carbons are shown grey, oxygens are red, 
nitrogens are blue, and hydrogens are white. Protons on the nucleophilic 
nitrogen, 2'-hydroxyl and water are displayed; for clarity, all others are omitted. 
Bond lengths are shown in A. The structure was obtained by matching isotope 
effects calculated with Gaussian03 and Isoeff to the experimental values. 
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to that observed for peptide bond formation. This indicates that the 
pK, of the intermediate in the context of these active sites is lower than 
that of a catalytic group, so that T+ does not have a finite lifetime and 
instead T— is the only stable intermediate’. Although the pK, of the 
amine is initially high, it decreases dramatically as the C-N bond is 
formed. For chymotrypsin, a catalytic histidine is responsible for 
deprotonating the nucleophile. For the ribosome, the lack of a pH 
dependence of the reaction argues against general-acid or general-base 
catalysis*. The 2’-hydroxyl is important for the ribosome reaction, but 
the pK, of a hydroxyl dictates that if it abstracts the amino proton it 
must donate its proton elsewhere. This mechanism, termed the ‘proton 
shuttle’, has been proposed in a variety of forms for the ribosome*”. 
Alternatively, the 2’-hydroxyl may provide an important transition 
state hydrogen bond, with another group or a water molecule 
deprotonating the nucleophile. Either possibility would be consistent 
with measured solvent isotope effects, which indicate more than one 
proton is in motion at the transition state (S. Kuhlenkoetter and 
M. Rodnina, personal communication). 

The destination of the proton originating from the nucleophile is 
uncertain. Two computational studies favour the carbonyl oxygen, 
which would otherwise develop a negative charge’?’*. This would 
result in an uncharged tetrahedral intermediate, which would be con- 
sistent with the lack of pH dependence of the reaction. Alternatively, 
a water molecule observed in crystal structures is correctly positioned 
to accept a proton from the 2’-hydroxyl and later donate it to the 
3'-oxygen leaving group. Finally, the 3’-oxygen could receive the 
proton from the 2'-hydroxyl. Our data cannot distinguish between 
these possibilities; for simplicity, we have modelled a water molecule 
as the proton acceptor. 

The ribosome increases the rate of peptide bond formation by an 
estimated 10’-fold through an altered chemical mechanism. The rate 
of nucleophilic attack is probably increased through positioning 
effects—first, the increased likelihood of the two substrates being in 
proximity, and second, precise orientation of the nucleophile (possibly 
by the 2'-hydroxyl of A2451; ref. 2). The ribosome also has a signifi- 
cant catalytic role, beyond substrate positioning, by coordinating 
nucleophilic attack and deprotonation in a single rate-limiting step. 
Catalysis by the ancient, conserved, RNA active site of the ribosome 
fundamentally alters the reaction pathway for peptide bond formation 
relative to the uncatalysed reaction. 


METHODS SUMMARY 


A general strategy for measuring isotope effects with these substrates using a 
competitive assay has been described”’. The P-site substrate cytidylyl-(3'5’)- 
cytidylyl-(3'5")-3'(2’)-O-(N-(6-D-(+)-biotinoylaminohexanoyl)-L-phenylalanyl) 
adenosine (CCApcb) was synthesized as described’*"*. For each measurement, 
two substrates differing by a single isotopic substitution at the reaction centre were 
mixed. Each substrate was labelled with either *”P or **P to allow differentiation by 
scintillation counting. 

Reaction mixtures contained 7 mM MgCl, 140 mM NH,Cl, 25 mM HEPES pH 
8.5, trace P-site substrate, 250-500 mM A-site substrate, and 6-10 uM 50S ribo- 
somes. At these concentrations, the isotope effect is on kcat/Km for the P-site 
substrate. Aliquots were quenched at 15-30% reacted and near the reaction end- 
point. Substrate and product were separated on an acrylamide gel and the fraction 
reacted determined by phosphorimaging. Each band was eluted and scintillation 
counted to determine the ratio of **P to **P in each sample. 

To account for substrate that hydrolysed before the reaction was started, the 
amount of *’P- and *?P-labelled product at zero time was subtracted from all deter- 
minations of the product isotope ratio. Additionally, some substrate was unreactive 
even at long time points. The amounts of *P- and **P-labelled unreactive substrate 
were similarly subtracted from all determinations of the substrate isotope ratio. 

Isotope effects were calculated as previously described’, using hybrid density 
functional methods implemented in Gaussian03. Transition state structures were 
optimized and frequencies computed on the optimized structures using the three- 
parameter Becke exchange functional, the LYP correlation functional and the 
standard 6-31G (d,p) basis set. Isotope effects were calculated from the computed 
frequencies using ISOEFF 98°. Methylamine was used to model the nucleophile. 
Calculated isotope effects for tetrahedral intermediate formation were repeated 
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with a set of nucleophiles of increasing size and complexity; these effects varied by 
less than 0.003. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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LETTER 


METHODS 


Reaction assay. A general strategy for measuring isotope effects with these sub- 
strates using a competitive assay has been described*'. The P-site substrate 
cytidylyl-(3'5’)-cytidylyl-(3'5')-3'(2')-O-(N-(6-b-(+)-biotinoylaminohexanoyl)- 
L-phenylalanyl)adenosine (CCApcb) was synthesized as previously described'*'®. 
Substrates were prepared that differed only by a single isotopic substitution at the 
reaction centre. Each substrate was 5'-end labelled with either y**P- or y**7P-ATP 
by polynucleotide kinase and purified on a pH 6 denaturing 15% polyacrylamide 
gel. °*P- and **P-CCApcb were mixed at an approximately 1:4 ratio and purified on 
a pH 6 non-denaturing 12% polyacrylamide gel. Purified mixes were used as soon 
as possible to minimize hydrolysis in the starting material. 

The isotopic enrichment of each sample was determined by high-resolution 
Fourier-transform-ion cyclotron resonance mass spectrometry. Each molecule 
had 95% or greater isotopic purity. 

Ribosome reaction mixtures contained 7 mM MgCh, 140 mM NH,Cl, 25 mM 
HEPES pH 8.5, trace P-site substrate, 250-500 mM A-site substrate, and 6-10 11M 
50S ribosomes. At these concentrations the A-site substrate is saturating but the 
ribosome concentration is not; therefore the isotope effect is on k-a/Kyy for the 
P-site substrate. All reactions were performed at room temperature. Reaction 
aliquots at 15-30% reacted (approximately 1 min) and near the reaction endpoint 
(greater than 30 min) were quenched with three volumes of formamide loading 
buffer. Each time point was run on a pH 6 denaturing 15% polyacrylamide gel to 
separate substrate from product. 

Substrate and product bands were visualized using a Storm 840 PhosphorImager 

with a two-ply sheet of Duck tape between the gel and screen to block **P emission. 
The fraction of **P-labelled substrate reacted could then be determined. The ratio of 
*°P to **P was determined by scintillation counting. Each product band was excised 
from the gel and eluted into 1 ml of 50 mM NaCl overnight. This elution was then 
added to 13 ml of Optima Gold scintillation fluid and counted for 30 min along with 
*°P and **P standards. 
Data analysis. Counts per minute were divided into two channels, 0-400 keV and 
400-2,000 keV. Approximately 70% of the **P standard was detected in the high 
energy channel, and greater than 99% of the **P sample was detected in the low 
energy channel. The ratio of *’P to **P in each sample could be determined using 
equation (1): 


33p/°2p = (A — Br)/[B(1 + n)] (1) 


where A is the counts per minute in the low energy channel, B is the counts per 
minute in the high energy channel, and ris the ratio of emission of a **P standard 
detected in the low energy channel to the high energy channel. To account for 
substrate that hydrolysed before the reaction was started, the amount of **P- and 
*°P-labelled product at zero time was determined. These were subtracted from all 
other determinations of the product isotope ratio. Additionally, some substrate 
was unreactive even at long time points. The amounts of **P- and **P-labelled 
unreactive substrate were similarly subtracted from all other determinations of the 
substrate isotope ratio. The observed isotope effect was then determined from the 
ratio in the midpoint and endpoint samples and the fraction reacted using equa- 
tion (2): 


KIE = log(1 — f)/log(1 — fR,/Ro) (2) 


where fis the fraction reacted, R,, is the isotope ratio in the product at that fraction 
reacted, and Rg is the isotope ratio in the product at the reaction endpoint. For 


high-fraction-reacted samples (greater than 50%), the isotope effect was also 
determined from substrate and endpoint samples using equation (3): 


KIE = log(1 — f)/log[(1 — f)R./Ro] (3) 


where R, is the isotope ratio in the remaining substrate at f fraction reacted, and Ro 
is as above. This value was corrected for incomplete isotopic incorporation as 
determined by mass spectrometry with equation (4): 


KE corrected = 1 + (KE observed — 1)/{1 — KE observea(1 — @)] (4) 


where ¢ is the isotopic enrichment of the heavy sample. This equation assumes a 
negligible amount of heavy isotope in the light sample'**'. 

For the control and «-deuterium isotope effects, enough trials were performed 
to plot a histogram of the data (Fig. 2). Both data sets were fitted to a normal 
distribution, equation (5): 


c= aexp{—0.5[(x — w)/o}} (5) 


where c is the number of counts in a given bin, a is the amplitude, vis the mean and 
o is the standard deviation. For some substitutions, the standard errors for trials 
performed on the same day were smaller than those from different days, indicating 
that each trial may not be independent. Therefore, for all substitutions each set of 
experiments from a single day were averaged. The mean and standard error of 
multiple data sets are reported. 

Computation of transition states. Transition state structures that reproduced the 
experimental KIEs were determined using hybrid density functional methods imple- 
mented in Gaussian03. Tetrahydro-4,5-dihydroxy-2-(methoxymethyl)furan-3-yl 
2-acetamidopropanoate was used to model the P-site substrate and methylamine 
was used to model the A-site nucleophile. Calculated isotope effects for tetrahedral 
intermediate formation were repeated with a set of nucleophiles of increasing size 
and complexity; the calculated isotope effects varied by less than 0.003. 

Structures of the transition states were optimized and the frequencies were com- 
puted for the optimized structures using the three-parameter Becke (B3) exchange 
functional, the LYP correlation functional and the standard 6-31G (d,p) basis set. The 
5'-methoxy group and the reaction centre were constrained during the optimization 
and many of these constraints were modified to match the experimental KIEs. 

KIEs and equilibrium isotope effects (EIEs) were calculated from the computed 
frequencies using ISOEFF 98°°. KIEs and EIEs were calculated for a temperature of 
298 K and the frequencies were scaled using a factor of 0.964, corresponding to 
B3LYP/6-31G(d, p). KIEs were calculated whenever the magnitude of the imaginary 
frequency was greater than 50icm” ', otherwise EIEs were computed. All vibrational 
modes were used to calculate isotope effects. 

Geometric and the electrostatic models were generated by iteratively optimizing 
the transition states by modifying the applied constraints until the computed 
isotope effects closely matched the experimental KIEs. 

The natural bond orbital (NBO) calculations were performed on optimized 
structures by including the pop = nbo keyword in the route section of input files. 
The molecular electrostatic potential (MEP) surfaces were calculated by the CUBE 
subprogram of Gaussian03. The formatted checkpoint files used in the CUBE 
subprogram were generated by constrained geometry optimization at the 
B3LYP level of theory with the 6-31G** basis set. MEP surfaces of the substrate 
and the transition states were visualized using Molekel4.0 at a density of 0.2 
electrons per A*. Geometric figures were created using PyMol. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature10281 


Migrastatin analogues target fascin 


to block tumour metastasis 


Lin Chen, Shengyu Yang, Jean Jakoncic, J. Jillian Zhang 
& Xin-Yun Huang 


Nature 464, 1062-1066 (2010) 


In this Letter, we reported the crystal structure of macroketone bound 
to fascin (Protein Data Bank number PDB 3LNA) (Fig. 2d). The 
chemical structure of macroketone was incorrectly shown and has 
been corrected (PDB 308K). We have been advised that the crystal- 
lographic data for this complex are not technically robust, and do not 
justify the conclusion that macroketone is bound as shown in Fig. 2d. 
We therefore regretfully withdraw the X-ray structure models PDB 
3LNA and PDB 308K from this study. However, we believe that the 
rest of the Letter, including the observations made using the mutants, 
is not directly affected. 
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Astronomy in the outback 


A bid to host amajor international radio telescope has created a demand for astronomers in 
Western Australia, and built a community that looks set to endure. 


BY JAMES MITCHELL CROW 


ery little happens in the outback of 
\ / Western Australia — and that’s how 
a growing group of astronomers likes 
it. The area is remote from urban centres and 
industry, so it is extremely radio-quiet: free of 
the ‘noise’ generated by cars, mobile phones and 
other trappings of civilization. That makes it an 
ideal spot in which to study radio waves from 
distant corners of the Universe, and is key to the 
nation’s bid to host a prestigious international 
telescope, the Square Kilometre Array (SKA). 
Five years ago, there was not a single astron- 
omy researcher based in the state. Now, with the 
SKA on the horizon, two advanced pilot radio 
telescopes planned, high-level government 


funding available and a fledgling astronomy 
community looking to add to its ranks, Western 
Australia has the potential to be a research hub 
for years to come. But to succeed, it will have to 
sustain funding and attract enough researchers 
to this remote corner of the world. 


SKA BAND 

The astronomy expansion was sparked in 2006, 
when an international committee announced 
that Australia was one of two nations bidding 
to host the SKA, picked from a shortlist of four. 
The €1.5-billion (US$2.1-billion) SKA will be 
the world’s most powerful radio telescope, a 
continent-spanning array of dishes with a total 
collecting area of one square kilometre that 
will combine to make an instrument 50 times 


more sensitive than today’s best. It is expected 
to help unlock some of the Universe’s biggest 
mysteries, including the nature of dark energy 
and dark matter. Next year, the SKA selection 
panel will make its final decision on whether 
the telescope will be built on sites spanning 
Australia and New Zealand, or in South Africa. 
Not every astronomer is convinced of the 
SKA‘s merits. Some wonder whether scientists 
and engineers can overcome technological, 
computing and energy-consumption chal- 
lenges to maximize the array’s effectiveness. 
“Any project of the scale that we are talking 
about has people who question whether this is 
the right way to spend the money,” says Anton 
Zensus, director of the Max Planck Institute 
for Radio Astronomy in Bonn, Germany. P 
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Hundreds of radio antennas could one day populate the outback, as in this artist’s rendition. 


> The SKA has ample support in Europe, but 
sentiment is mixed among US astronomers; 
some suggest that a large optical telescope 
should be the priority. The array’s proponents 
are pushing ahead anyway, continuing with 
the site-selection process and developing 
technologies to enable it. Western Australia’s 
astronomy community has already benefited. 


INFRASTRUCTURE BUILD-UP 
The University of Western Australia (UWA) 
and Curtin University, both in Perth, are work- 
ing hard to bring astronomers to the state. 
In 2006, the state government awarded two 
Aus$1-million (US$1.1-million) Premier’s 
fellowships to allow the UWA to hire radio 
astronomers. At the same time, Curtin estab- 
lished a high-profile position in radio astron- 
omy, and another in astronomy engineering. 
The academics who filled these posts came 
up with a plan to expand the local astronomy 
community. On 1 September 2009, they opened 
the International Centre for Radio Astronomy 
Research (ICRAR) in Perth — a joint venture 
between the UWA and Curtin, established with 
Aus$100 million in funding from the universi- 
ties and the state government to take it through 
to 2014. Planning for a grant-renewal applica- 
tion to form an expanded ICRAR 2 is already 
under way, says Peter Quinn, one of the Pre- 
mier’s fellows and the director of ICRAR. The 
university recruited him from the European 
Southern Observatory in Munich, Germany. 
The centre already employs more than 60 
staff members, 55 of whom are researchers in 
astronomy, engineering or information tech- 
nology. More than half have been recruited 
from overseas. It also has 22 PhD students and 
20 postdocs. “We aim to be a really strong part 
of the Australian bid for the SKA’ says Quinn, 
“but we want also to be a major contributor to 
the international SKA project, independent of 
where the SKA is placed”? The UWA and Cur- 
tin have already established seven permanent 
tenure-track positions at ICRAR. 


Alan Duffy is a postdoctoral researcher 
who joined ICRAR in 2009. He came from 
one of the world’s most prestigious astronomy 
research institutes, the Jodrell Bank Centre for 
Astrophysics in Manchester, UK. The culture 
difference is marked, says Duffy. “It’s a lot more 
vibrant,” he adds. “In ICRAR there’sa real sense 
ofa bit more adventure going on, as it is sonew” 

Two key attractions brought Duffy to 
ICRAR. “In the bigger institutes in Europe, 
often you're more of a cog in a machine, 
whereas at ICRAR I had the chance to really 
define my own research agenda,’ he says. Duffy 
runs computer simulations to test theories 
related to galaxy formation and dark matter. 

The other attrac- 
tion was the infra- 
structure being built 
nearby, including 
two ‘pathfinder’ radio 
telescopes that will 
test some of the tech- 
nology to be used 
on the SKA. One is 
the Aus$150-million 
Australian SKA Path- 
finder (ASKAP), 


which will, forexam- “There’s areal 
ple, try out ways of sense of abit 
viewing huge areas more adventure 
of the sky atonceby going on, asitis 
combining the signals sonew.” 

from multiple anten- Alan Duffy 


nas. The other is the 
Murchison Widefield Array, which will consist 
of 512 antenna elements and will test some of 
the signal-processing systems to be deployed 
on the SKA. Both instruments will be located at 
the Murchison Radio-astronomy Observatory, 
opened in 2007 about 750 kilometres north of 
Perth. The observatory will be the base of the 
SKA if Western Australia's bid is successful. 
The raw data from the telescopes will be 
stored and processed at the Pawsey Centre, an 
Aus$80-million supercomputing facility in a 
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suburb of Perth, which is expected to be among 
the top 20 fastest supercomputers in the world 
when it opens in 2013. Astronomy brought the 
Pawsey Centre to Western Australia, but other 
scientists will benefit from its presence, includ- 
ing geoscientists and biologists, says Andrew 
Rohl, executive director of iVEC, the joint ven- 
ture that will build and run Pawsey. 

The investment so far adds up to between 
Aus$450 million and $470 million. This is spe- 
cially allocated money, notes Quinn, and it has 
not been taken from existing state or federal 
government astronomy funds or projects. 


IS THE SKA THE LIMIT? 

This year, six Australian universities, including 
the UWA and Curtin, joined together in the 
Australian Research Council Centre of Excel- 
lence for All-Sky Astrophysics (CAASTRO), a 
consortium aimed at training astronomers. It 
has Aus$28 million of federal funding, which 
goes mainly towards employing postdocs, says 
Lister Staveley-Smith, deputy director of the 
consortium. The first recruits have already 
started work, and the centre will support close 
to 50 postdocs over the next seven years, with 
funding renewal possible. 

If Australia wins the right to host the SKA, 
there will probably be another hiring surge. “If 
the SKA did come to Australia, then I could see 
ICRAR growing to a major science centre in 
the 100-plus staff range,” says Quinn. Zensus 
predicts that there will be several hundred 
researchers affiliated with the SKA at any one 
time, first commissioning the site and then 
operating the central array, conducting experi- 
ments and running the computer centre. 

Zensus notes that Western Australia’s work- 
force and infrastructure are unlikely to “fizzle 
away’, whether the bid is successful or not. The 
astronomy infrastructure elsewhere in Aus- 
tralia — such as SkyMapper, an optical survey 
telescope near Canberra — should complement 
the radio data from ASKAP. “From an academic 
point of view, there’s a critical mass now — 
you don't notice any isolation because they've 
brought some of the best people in the world 
to you,’ says Duffy. Staveley-Smith agrees. “You 
wouldnt think that Perth would be the number- 
one spot for someone from Europe to apply for 
a job, but we've had some excellent applicants,” 
he says. “I think the astronomy community 
worldwide recognizes that the government here 
is being very supportive to astronomy at the 
moment, coincidentally at a time when things 
are not quite so good in other places.” 

“If anything, I’m even more keen to stay in 
the country than when I first applied,” says 
Duffy, who is a year away from the end of 
his postdoctoral contract and is applying for 
an Australian Research Council early-career 
award. “There's a sense that Australia really is 
this amazing place to be right now’ = 


James Mitchell Crow is a science journalist 
based in Melbourne, Australia. 
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TURNING POINT 


Joel Rosenthal 


Joel Rosenthal, a chemist at the University of 
Delaware in Newark, was one of 30 young 
researchers in the United States to win a 2011 
Ralph E. Powe Junior Faculty Enhancement 
Award, which provides research seed money. 
The award came in April from the Oak Ridge 
Associated Universities consortium, based in 
Tennessee. 


What was the first turning point in your 
career? 

I always thought I'd be a biologist, but as 
an undergraduate at New York University 
(NYU), I discovered a logic behind chem- 
istry — its mechanisms and its quantitative 
nature — that I found compelling. Using the 
mechanisms underlying chemical reactions 
to build compounds and materials was like 
solving a puzzle. So I started studying organic 
chemistry. Then I heard Daniel Nocera, an 
inorganic chemist at the Massachusetts Insti- 
tute of Technology (MIT) in Cambridge, give 
a seminar on molecular-level mechanisms of 
energy conversion. I realized that inorganic 
chemistry offered many more avenues for 
researching energy storage and release than 
did organic chemistry. During Dan's visit to 
NYU, he encouraged me to apply to MIT for 
my PhD. I went on to study inorganic chem- 
istry in his lab. 


Nocera’s work drew media attention. What 
did that teach you? 

Dan has achieved a series of high-profile 
advances — most recently, an inexpensive 
solar cell dubbed the artificial leaf. I was in 
his lab when he first started to attract inter- 
est from the mainstream press, including the 
television news programme Nightline and the 
science show NOVA. I began to understand 
how important it is to communicate chem- 
istry and energy research to the public. Dan's 
ability to break important problems into 
small yet meaningful advances was a big part 
of his success. 


What influenced your decision to accept a 
position at Delaware? 

This university and department are under- 
rated. Delaware is putting lots of resources 
into equipment such as spectrometers and an 
advanced X-ray diffractometer. In the cur- 
rent economic climate, this is one of the few 
institutions that is significantly increasing 
its research programme and hiring faculty 
members. There is a great atmosphere in 
my department, with six young professors 
all building their research programmes and 


collaborating with one another and with 
senior colleagues. 


You assembled a seven-member lab group 
in your first 18 months at Delaware. Do you 
worry about over-committing your resources 
through such fast growth? 
It can be difficult to compete with established 
researchers when recruiting students, but I 
convinced a postdoc and six first-year gradu- 
ate students to join me. I’m not worried; in 
fact, 'm looking to grow further. Society will 
suffer if scientists don't tackle capture and 
storage of energy. In my lab, we're trying to 
generate synthetic fuels by using nickel and 
palladium electrocatalysts to convert carbon 
dioxide into, for example, energy-rich carbon 
monoxide, which is a building block for fuels. 
I hope that in the next five to six years, 
my team and others will have mapped out 
important molecular-design principles that 
will help us to design catalysts. I need as many 
people helping me as possible. The question is 
whether I bring in enough dollars to support 
them all. But being here — where costs are 
much lower than at MIT, for example — has 
made it easier to quickly establish a group. 


What has been the biggest change in energy 
research in the past few years? 

There is a much greater emphasis on applied 
science, which is important, but I hope that 
it will not consume the field. There are also 
more people coming in. We need scientists, 
but I also see people doing their same old 
research and wrapping it in an energy con- 
text to tap into funding, without trying to 
be innovative. We need paradigm-shifting 
fundamental research. = 
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UNITED KINGDOM 
More visas for scientists 


As part ofa policy overhaul, Britain has 
launched a visa scheme for ‘exceptionally 
talented’ people, including scientists. UK 
immigration policies were revised last year 
to reduce the number of migrant workers 
allowed in (see Nature 467, 491; 2010); the 
latest scheme came into effect on 9 August. 
The science portion will be administered 
by the Royal Society in London, which 

can recommend 300 researchers per year 
whose potential to become leaders in 

their fields is evidenced by major prizes 

or fellowships. Applicants need not be 
sponsored by an employer. Visas are 

valid for 40 months, after which visiting 
scientists may apply for a two-year 
extension, followed by possible settlement. 


EUROPEAN UNION 
Industrial PhD launched 


The European Commission (EC) has 
created a pilot industrial-PhD programme 
to unite academia and businesses. The 
European Industrial Doctorates scheme 
will have €20 million (US$28 million) 

to fund about 100 researchers until 

2013, and aims to teach scientists how 

to combine scientific excellence with 
business skills. Participating PhD students 
at partner universities will spend at least 
half their time at a company in a different 
European Union (EU) state or associated 
country. Partner businesses will pay some 
expenses. EC policy coordinator Vanessa 
Debiais-Sainton says that the programme 
is likely to become a permanent part of 
the EU research-funding scheme Horizon 
2020, set to launch in 2014. 


INNOVATION 
Female inventors’ prize 


A pilot award scheme aims to encourage 
scientific innovation and invention by 
women in the European Union (EU). 

The EU Prize for Women Innovators 

will award €100,000 (US$142,000) for 
first place, €50,000 for second place and 
€25,000 for third. Applicants must be EU 
residents, have founded or co-founded a 
business before 2009 and have received 
EU funding. Entries will be judged on 
scientific merit and economic impact. 
Applications are open until 20 September. 
European Commission (EC) spokesman 
Mark English says that the EC hopes to 
inspire more women to commercialize 
their research. In future, the award criteria 
could change to emphasize research at an 
earlier stage of commercialization. 
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INTERVENTION 


BY JOHN GILBEY AND BRIAN MALOW 


r he marquee outside Building 40 
sizzled quietly in the late afternoon 
Californian sun, a touch of retro flair 

ona starkly modern building. In front of me 
in the queue for registration, a famous tele- 
vision cosmologist and a cheerful, bearded 
philosopher were animatedly comparing 
their recent flights and the number of time- 
zones they‘ crossed, while the smiling but 
slightly pressured young organizers dished 
out name badges, ethically sourced cotton 
goodie bags and T-shirts emblazoned with 
the conference logo. 

“Hi John! Great to see you again!” Brian, 
grinning wildly, pumped my hand in wel- 
come. “Bit warmer than London, isn’t it? 
Let’s find a drink” 

It was cooler inside the building, where 
the sun was filtered to a murky glow, and 
as we commandeered a nearby coffee pot 
I asked Brian if he really thought this was 
going to work. “Sure it will! You Brits worry 
way too much. Look, we’ve narrowed it 
down to about 20 people — everyone else is 
too well-known. Ifthere’s one here I reckon 
that these folk have to be our best shot.” 

I scanned the list. None of the names 
jumped out as potential conspirators — but 
I guess that is the whole point. “OK,” I sighed 
resignedly, “I'm game if you are. I suppose if 
it goes badly wrong we can always say it was 
a piece of performance art...” 

“Good call — you're only 40 miles from 
San Francisco, remember ... So, heck, any- 
thing goes!” 

I looked again at the list. “OK, but in that 
case I get to check out the girls...” 

Brian laughed out loud, spilling his cof- 
fee, and said something unrepeatable about 
my lifestyle — then we set out to circulate 
through the amiable crowd of slightly jet- 
lagged scientists, philosophers and other 
vaguely off-beat thinkers. 

An hour later, having munched my way 
along the wonderfully healthy buffet and 
then quashed any lingering benefits with 
a couple of artisan beers fresh from the 
ice bucket, I was no nearer finding whom 
I was looking for. Everyone was friendly, 
effusive and seemingly not at all worried by 
the distinctly odd questioning of a balding, 
bearded, overweight writer. 

Another beer, or two, convinced me 
that our mission was misdirected, that 
our premise was flawed and that — in any 
case — our methodology for delivery was 


Time out. 


doomed to ignominy. Most of all, the whole 
thing suddenly seemed a terrible waste of 
this beautiful, deep-blue-skied evening in 
Silicon Valley — surrounded as I was by 
some of the brightest minds of the age. 

It was then that I spotted Brian in 
animated discussion with a petite, dark- 
haired girl just next to the ice-cream freezer. 
Wandering past, I checked her name badge 
— and was outraged to realize she was on 
my list. 

Ihave to admit though, that he was very, 
very good: 

“I guess you're here for the same reason 
Iam, yeah?” She smiled at this and made a 
non-committal move of her head. “Thought 
so. I mean, the theme of the conference alone 
is going to attract people like us, am I right? 
I mean, “Time as a Flexible Continuum’ — 
there have to be a few of us here, huh...?” 

Brian’s engaging grin started to thaw her 
out, and she nodded slowly. “Terrific! I can 
see we are going to have to swap notes on 
this. I hadn't realized that they started talking 
so seriously about temporal stuff this early 
— I thought it wasn’t for another 50 years. 
I mean, the technical side is way too primi- 
tive to bother with — but I’m more into the 
socio-ethical stuff myself. How about you? 

Are you into physics or 


> NATURE.COM philosophy?” 

Follow Futures on She glanced around, 
Facebook at: spotted me and mut- 
go.nature.com/mtoodm tered something 
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inaudible to Brian. He looked at me, raised 
one eyebrow a millimetre, offered her his 
business card and replied casually: “Sure, 
why not. Oh, when did you say you were 
from?” 

“2158, but I...” 

“Quick! Get the net!” he yelled. 

Reaching under my jacket, I lurched for- 
ward and threw the nylon mist net over the 
girl. Brian followed up with one of his own 
and we danced around the crest-fallen, tan- 
gled figure, whooping and high-fiving each 
other in semi-drunken delight. 

“Guys! What is this? This is REALLY 
uncool!” A tall, gaunt lady with an organiz- 
er’s badge stood in front of us with her hands 
planted firmly on her hips. She did not look 
happy. From the edge of the growing crowd, 
two unsmiling security guards started shoul- 
der-walking their way towards us. 

“She's from the future! From 2158! She 
just admitted it!” I yelled at the increasingly 
restless throng around us. The girl was strug- 
gling under the nets, trying to free herself. 
Some kindly looking folk were attempting 
to help, with limited success. 

“Just think what she knows!” Brian 
interjected. “With what she tells us we 
could shortcut maybe a hundred years of 
research...” 

A lean, hard-eyed Californian had joined 
the organizer lady. “What if she doesn't want 
to tell you anything?” he asked in a meaning- 
ful tone. 

“Why, damn it, we'll MAKE her!” I yelled 
across at him. 

There was a cry of collective horror — 
then the people around us became angrily, 
watchfully quiet. At an unseen signal, the two 
security staff strode forward and pinioned 
our arms with professional efficiency. 

“Sorry friend, that’s not what happens 
here...” 

We were led towards the exit. A knot of 
familiar, determined-looking science folk 
moved with us — led by the lean Californian. 
As this curious circus reached the parking 
area a van with heavily tinted glass pulled up 
to the kerb and the side door powered open. 

As we were bundled unceremoniously 
into the van, the Californian turned to us 
and said quietly: “Did you really think she 
would risk being here alone?” = 


John and Brian keep bumping into each 
other at conferences — the rest is fiction, 
inspired by an original stand-up comedy 
routine of Brian’. 


JACEY 
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Phylogenetic position of Diania challenged 


ARISING FROM J. Liu et al. Nature 470, 526-530 (2011) 


Liu et al.' describe a new and remarkable fossil, Diania cactiformis. 
This animal apparently combined the soft trunk of lobopodians (a 
group including the extant velvet worms in addition to many 
Palaeozoic genera) with the jointed limbs that typify arthropods. 
They go on to promote Diania as the immediate sister group to the 
arthropods, and conjecture that sclerotized and jointed limbs may 
therefore have evolved before articulated trunk tergites in the imme- 
diate arthropod stem. The data published by Liu et al.1 do not un- 
ambiguously support these conclusions; rather, we believe that Diania 
probably belongs within an unresolved clade or paraphyletic grade of 
lobopodians. 

Without taking issue with the interpretation of Diania offered by 
Liu et al.’, or of the manner in which they coded their characters, we 
were nonetheless unable to derive their cladogram optimally from the 
data published. Moreover, we could not replicate their results using 
any other plausible optimality criteria, or by varying additional para- 
meters not specified by the authors. 

Liu et al.’ report analysing their data in PAUP** under maximum 
parsimony and with implied weights’ using k = 2 (a rather arbitrary 
choice), but do not mention any other assumptions (for example, the 
imposition of character order). They obtained three most parsimo- 
nious trees, each of 130 steps. Straightforward replication of their 
stated settings yields 13 trees of just 90 steps each, the strict consensus 
of which is illustrated (Fig. 1). Why such a difference? 

Several of their characters contained inapplicable or gap codings. 
These appear where a ‘daughter’ character is logically contingent 
upon the state of a ‘parent’, and cannot be coded when the parent is 
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Figure 1 | The strict consensus of 13 most parsimonious trees (L = 90) 
obtained from the published data and settings specified by the authors. 


absent. For example, character 6 (position of frontal appendage) can 
only be coded in taxa that possess a frontal appendage (character 5) in 
the first instance (such that a “0” for character 5 necessitates a “-” for 
character 6). In morphological analyses such as this, inapplicable 
states are usually assumed to have no bearing on the analysis, being 
reconstructed passively in the light of known states. In analyses of 
nucleotide data, by contrast, gaps may alternatively be construed as a 
fifth and novel state, because shared deletions from some ancestral 
sequence may actually be informative. If this assumption is made with 
morphological data, however, all the logically uncodable states in a 
character are initially assumed to be homologous, and a legitimate 
basis for recognizing clades. At best, this assigns double weight a priori 
to absences in the ‘parent’ character (because the daughter is always 
contingent), and at worst is positively misleading. This is the approach 
that we believe Liu et al.’ may have taken. Reanalysis of their data using 
“gapmode = newstate’ combined with ‘collapse = MinBrlen’ settings 
in PAUP*? produced some optimal trees of 130 steps. However, we 
were still unable to replicate the relationships shown in their Fig. 4, 
even when varying k between 0 and 10. Rather we either resolved 
Diania in a basal polytomy, or slightly higher in the tree but separated 
from the arthropods by at least five nodes. 

At best, therefore, the position of Diania is highly labile and extre- 
mely sensitive to the precise methods used. We certainly feel that it is 
premature to draw conclusions regarding its supposedly pivotal posi- 
tion in the evolution of arthropods. However, our reanalyses do not 
challenge the more general conclusions of Liu et al.': namely that the 
full complement of arthropod characters were probably acquired 
piecemeal and possibly convergently. Many closely allied groups 
exploited successfully some but not all of the characters that typify 
the arthropod crown group. Only in retrospect do we discern a single, 
ladder-like trajectory through what was really a much more eccent- 
rically branching bush. 
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Lobopodian phylogeny reanalysed 


ARISING FROM J. Liu et al. Nature 470, 526-530 (2011) 


Liu et al.’ described an ‘armoured’ lobopodian, Diania cactiformis, 
from the Chengjiang Lagerstatte (China; Cambrian, stage 3); this fossil 
bears potentially arthropod-like articulated and possibly sclerotized 
appendages, but lacks a sclerotized body. A cladistic analysis resolved 
Diania as sister-taxon to arthropods. From this phylogenetic position 
the authors tentatively inferred that arthropodization (sclerotization of 
limbs) may have preceded arthrodization (sclerotization of body ele- 
ments) in arthropod evolution. Although we concur with the reasoning 
behind this inference, it rests on a phylogenetic placement that our 
analysis of the published data set does not reproduce. 

Our analyses were undertaken using implicit enumeration (branch 
and bound) in TNT (Tree Analysis using New Technology) v.1.1 
(ref. 2), first repeating the analysis of Liu et al.' with implied character 
weighting (k = 2) and additionally with equal character weighting; 
results are shown in Fig. 1b, c. In neither analysis was Diania resolved 
as sister-taxon to Arthropoda, instead belonging to an unresolved 
polytomy at the base of the resultant most-parsimonious trees. To 
confirm that the trees of Liu et al.’ are less parsimonious, a second pair 
of analyses was undertaken. Here tree topologies were constrained to 
the published strict-consensus topology (Fig. 1a), again using implied 
(k = 2) and equal weighting schemes. In both cases a single tree was 
found, substantially longer than those our first analyses recovered 
(16.4 as opposed to 12.4 steps under implied weighting; 119 as 
opposed to 89 steps under equal weighting). This demonstrates that 
the published topology is far from the most parsimonious that can be 
obtained from the data set. The use of a branch and bound (rather 
than heuristic) search indicates that this discrepancy is not a result of 
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Figure 1 | Phylogeny of Cambian lobopodians and stem-group arthropods. 
a, Topology proposed by Liu et al.' (implied weights, k = 2, 16.4 steps, 
consistency index (CI) = 0.336, retention index (RI) = 0.546; equal weights, 
119 steps, CI = 0.336, RI = 0.546). b, Strict consensus of five most 
parsimonious trees using the methods of Liu et al.' (implied weights, k = 2, 12.4 


E2 | NATURE | VOL 476 | 11 AUGUST 2011 


chance failure to hit upon the best solution; instead we believe that it 
reflects a methodological error in the original analysis. 

The phylogeny recovered here suggests a polyphyletic origin of 
arthropodized limbs, where the arthropodization in Diania was acquired 
separately to that of the crown-group arthropods. It does not rule out the 
‘leg-first’ (as opposed to arthrodization-first) scenario posited by Liu et 
al.', but provides no support for it either. The recovered polychotomy 
demonstrates that the character coverage of Liu et al.’ is inadequate to 
resolve the interrelationships of lobopodians; a more comprehensive 
data set is required to assess the position and significance of Diania 
properly. We note, however, that our results from the Liu et al.’ data 
set (Fig. 1b) place the euarthropods within the dinocaridids (radiodonts 
and related taxa), and as such are similar to other analyses° * which have 
used more character- and taxon-rich data sets. The origin of arthropo- 
dized trunk limbs is problematical in this topology, as dinocaridids 
apparently lack any form of leg-like trunk appendage’. The increasingly 
detailed fossil record of stem-group euarthropods provides our best 
chance of resolving this issue, but as yet has failed to do so; unequivocal 
evidence for any particular ordering of acquisition in these characters is 
not yet available. Diania is a fascinating animal, but in its revised position 
it contributes little to this debate. 
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steps, Cl = 0.444, RI = 0.713). ¢ Strict consensus of 86 trees produced with 
equal character weighting (89 steps, CI = 0.499, RI = 0.718). Numbers 
represent important characters of Liu et al.' relating to arthropodization and 
arthrodization. 
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REPLYING TO R. C. P. Mounce & M. Wills Nature 476, doi:10.1038/nature10266 (2011); D. A. Legg et al. Nature 476, doi:10.1038/naturel10267 


(2011); 


We welcome the reanalyses by Mounce and Wills' and Legg et al.’ of 
our paper’, and although we do not fully concur with their conclu- 
sions we are pleased that Diania has reopened the debate about key 
stages in arthropod evolution. We accept that the position of this fossil 
remains sensitive to parameters of analysis and in the original pub- 
lication we conceded that our best-supported tree—Diania as sister- 
group to (Schinderhannes + Euarthropoda)—could be subject to 
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change, and that the ‘walking cactus’ may have a more basal position 
within the overall framework of the arthropod stem-group. These 
alternative treatments of our data would seem to confirm this sus- 
picion, although we find the placement of Diania in an unresolved, 
and extremely basal, polytomy alongside velvet worms, tardigrades 
and various other lobopodians similarly problematical. We do not 
doubt that the authors’ results'” are statistically well supported, but 
what do these cladograms tell us about the evolution of the group? 
Lobopodians are, by their nature, fairly simple and consequently yield 
few convincing synapomorphies, either with each other or with 
arthropods in general. As we discovered, this makes scoring a robust 
data matrix including both lobopodians and arthropods challenging, 
and we wonder whether the basal polytomies recovered here are 
simply due to clustering among taxa with few unequivocal apomor- 
phies and/or much missing data. 

Our original placement of Diania close to the euarthropods was 
strongly influenced by the character of jointed trunk appendages. We 
acknowledge an error in scoring characters 17 and 27 of Tardigrada, 
which should have been 0 and 1, not 1 and 0, respectively. We also accept 
the criticism by Mounce and Wills’ that parent-daughter characters are, 
to some extent, dependent upon one another and can artificially inflate 
support for particular clades. This situation is hard to avoid when select- 
ing characters across a range of lobopod/arthropod fossils, which 
include taxa with unusual morphologies (for example, Opabinia) or 
where there are alternative hypotheses for the homology of a given 
lobopod feature and its probable arthropod equivalent: should a lobo- 
pod/dinocaridid flap be scored the same way as a euarthropod limb 


Figure 1 | Reanalysis of our data using PAUP: bootstrap 50% majority-rule 
consensus tree. Hierarchical structure in the data was assessed using the PTP 
test® as implemented in PAUP. Parsimonious trees were found through a 
heuristic search strategy using tree bisection-reconnection (TBR) branch 
swapping. In an initial analysis, all characters were treated as unordered. 
Branch support was based on calculating bootstrap values (10,000 replicates 
and with rearrangement limit of 10,000,000 rearrangements per additional 
sequence). A significant value in the PTP test (P = 0.01) does suggest the 
presence of a phylogenetic signal in the morphological data, supporting the 
given topology. Thirty-eight characters were parsimony-informative. The 
heuristic search for a maximum parsimony solution resulted in only one 
parsimonious tree (length = 121, homoplasy index (HI) = 0.67, consistency 
index (CI) = 0.33, retention index (RI) = 0.55, excluding 1 uninformative 
character). This tree contains 26 resolved nodes, of which many were strongly 
supported by bootstrap values (100+) and in some nodes bootstrap values 
showed even greater resolution. 
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exite; could both be subsumed under a ‘biramous limb’, etc.? 
Furthermore, a disadvantage of PAUP is that ‘gaps’ are automatically 
treated as missing in the analysis, and the only way to distinguish 
between them is to treat the gap as a new state; although a gap in our 
analysis refers to an inapplicable character, which is itself not a new state. 

In our initial analysis data were entered into a matrix using 
MacClade version 3.05 (ref. 4). Analyses were performed with PAUP 
version 4.0b10 (ref. 5), whereby multi-state characters were treated as 
‘unordered’, other characters were treated as ‘ordered’, and a branch 
and bound search under implied weights (k = 2) produced many most 
parsimonious trees—including trees similar to those of Legg et al.’ and 
Mounce and Wills’. For the purpose of character state mapping, we 
selected one of these most parsimonious trees (see Fig. 4 in ref. 3) which 
we felt best reflected the complex relationships among lobopodians 
and arthropods. Also, a constrained analysis was conducted, and three 
monophyletic groups (lobopodians + Onychophora, Radiodonta, 
Arthropoda) were enforced. To verify the stability of this tree, we 
conducted bootstrap analysis and the result (Fig. 1) shows that the 
contained 26 resolved nodes of this tree were also strongly supported 
by bootstrap values (100+) and in some nodes bootstrap values 
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Figure 2 | Tree shows the Bremer support value, which strongly supports 
Diania being sister group to Arthropoda (Bremer index = 4). Four 
additional steps would be required to disrupt the Diania-arthropod 
relationship. As we noted in the original description, Diania is the most 
arthropod-like of the lobopodians recorded so far—at least in terms of limb 
morphology—and we feel that it is better placed on the arthropod stem, rather 
than in an unresolved polytomy. 
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showed more resolution. Additionally, a significant value of the parti- 
tioning tail permutation (PTP) test (P = 0.01) suggests the presence of 
a clear phylogenetic signal in the morphological data, also strongly 
supporting the topology shown. Furthermore, a test of Bremer support 
still supports our initial analysis (Fig. 2). 

Legg et al.” drew attention to the absence of sclerotized trunk limbs in 
dinocaridids (Anomalocaris, etc.), which remains for us one of the great 
puzzles of stem-group arthropod evolution. Put simply, dinocaridids 
have a more arthropod-like head region (cephalisation, eyes, sclerotized 
mouthparts), whereas Diania lacks such sophistication in the anterior 
body region, but has jointed trunk appendages. One solution to this 
puzzle would be to assume that dinocaridids also had trunk limbs, but 
lost them secondarily. In this scenario Diania could sit comfortably as 
sister group to a (Dinocaridida + Arthropoda) clade: all three sharing 
jointed appendages, the latter two sharing cephalisation, etc. 

We accept that the tree of Legg et al.” and Mounce and Wills’ is the 
strict consensus tree, but we feel that the strict tree is, in this case, too 
conservative to provide meaningful information about the early 
evolution of the arthropods. This was undoubtedly a complex process, 
and may have involved numerous parallel developments introducing 
homoplasy. Under these circumstances, the comb-like most parsimo- 
nious tree is unhelpful and exemplifies a disadvantage of parsimony in 
this instance when faced with a complex early radiation. Indeed, Legg 
et al.’ recognized that the strict consensus tree(s) “suggests a poly- 
phyletic origin of arthropodized limbs”. If true, this would be signifi- 
cant. In this context, Mounce and Wills’ seem to have overlooked the 
potential significance of their reanalysis of our data. Much evidence 
has been accumulated that Arthropoda is monophyletic, but the 
logical conclusion of both reanalyses is that jointed legs are homo- 
plastic, at least within the lobopodian-arthropod assemblage. The 
implications of this are not trivial, bringing us close to the (largely 
discredited) Manton school of thought in which jointed appendages 
were proposed to have evolved in parallel in different (here euarthro- 
pod) lineages. At a fundamental level we need to know how easy it is to 
turn a soft, lobopodian limb into a sclerotized arthropod one, and 
whether this happened once, twice or on multiple occasions. For this 
reason alone, we believe that Diania is not merely fascinating, but 
remains invaluable to the evolutionary debate by challenging our 
notion of what it means to be an arthropod. 
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Phylogenetic position of Diania challenged 


ARISING FROM J. Liu et al. Nature 470, 526-530 (2011) 


Liu et al.' describe a new and remarkable fossil, Diania cactiformis. 
This animal apparently combined the soft trunk of lobopodians (a 
group including the extant velvet worms in addition to many 
Palaeozoic genera) with the jointed limbs that typify arthropods. 
They go on to promote Diania as the immediate sister group to the 
arthropods, and conjecture that sclerotized and jointed limbs may 
therefore have evolved before articulated trunk tergites in the imme- 
diate arthropod stem. The data published by Liu et al.1 do not un- 
ambiguously support these conclusions; rather, we believe that Diania 
probably belongs within an unresolved clade or paraphyletic grade of 
lobopodians. 

Without taking issue with the interpretation of Diania offered by 
Liu et al.’, or of the manner in which they coded their characters, we 
were nonetheless unable to derive their cladogram optimally from the 
data published. Moreover, we could not replicate their results using 
any other plausible optimality criteria, or by varying additional para- 
meters not specified by the authors. 

Liu et al.’ report analysing their data in PAUP** under maximum 
parsimony and with implied weights’ using k = 2 (a rather arbitrary 
choice), but do not mention any other assumptions (for example, the 
imposition of character order). They obtained three most parsimo- 
nious trees, each of 130 steps. Straightforward replication of their 
stated settings yields 13 trees of just 90 steps each, the strict consensus 
of which is illustrated (Fig. 1). Why such a difference? 

Several of their characters contained inapplicable or gap codings. 
These appear where a ‘daughter’ character is logically contingent 
upon the state of a ‘parent’, and cannot be coded when the parent is 
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Figure 1 | The strict consensus of 13 most parsimonious trees (L = 90) 
obtained from the published data and settings specified by the authors. 


absent. For example, character 6 (position of frontal appendage) can 
only be coded in taxa that possess a frontal appendage (character 5) in 
the first instance (such that a “0” for character 5 necessitates a “-” for 
character 6). In morphological analyses such as this, inapplicable 
states are usually assumed to have no bearing on the analysis, being 
reconstructed passively in the light of known states. In analyses of 
nucleotide data, by contrast, gaps may alternatively be construed as a 
fifth and novel state, because shared deletions from some ancestral 
sequence may actually be informative. If this assumption is made with 
morphological data, however, all the logically uncodable states in a 
character are initially assumed to be homologous, and a legitimate 
basis for recognizing clades. At best, this assigns double weight a priori 
to absences in the ‘parent’ character (because the daughter is always 
contingent), and at worst is positively misleading. This is the approach 
that we believe Liu et al.’ may have taken. Reanalysis of their data using 
“gapmode = newstate’ combined with ‘collapse = MinBrlen’ settings 
in PAUP*? produced some optimal trees of 130 steps. However, we 
were still unable to replicate the relationships shown in their Fig. 4, 
even when varying k between 0 and 10. Rather we either resolved 
Diania in a basal polytomy, or slightly higher in the tree but separated 
from the arthropods by at least five nodes. 

At best, therefore, the position of Diania is highly labile and extre- 
mely sensitive to the precise methods used. We certainly feel that it is 
premature to draw conclusions regarding its supposedly pivotal posi- 
tion in the evolution of arthropods. However, our reanalyses do not 
challenge the more general conclusions of Liu et al.': namely that the 
full complement of arthropod characters were probably acquired 
piecemeal and possibly convergently. Many closely allied groups 
exploited successfully some but not all of the characters that typify 
the arthropod crown group. Only in retrospect do we discern a single, 
ladder-like trajectory through what was really a much more eccent- 
rically branching bush. 
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Lobopodian phylogeny reanalysed 


ARISING FROM J. Liu et al. Nature 470, 526-530 (2011) 


Liu et al.’ described an ‘armoured’ lobopodian, Diania cactiformis, 
from the Chengjiang Lagerstatte (China; Cambrian, stage 3); this fossil 
bears potentially arthropod-like articulated and possibly sclerotized 
appendages, but lacks a sclerotized body. A cladistic analysis resolved 
Diania as sister-taxon to arthropods. From this phylogenetic position 
the authors tentatively inferred that arthropodization (sclerotization of 
limbs) may have preceded arthrodization (sclerotization of body ele- 
ments) in arthropod evolution. Although we concur with the reasoning 
behind this inference, it rests on a phylogenetic placement that our 
analysis of the published data set does not reproduce. 

Our analyses were undertaken using implicit enumeration (branch 
and bound) in TNT (Tree Analysis using New Technology) v.1.1 
(ref. 2), first repeating the analysis of Liu et al.' with implied character 
weighting (k = 2) and additionally with equal character weighting; 
results are shown in Fig. 1b, c. In neither analysis was Diania resolved 
as sister-taxon to Arthropoda, instead belonging to an unresolved 
polytomy at the base of the resultant most-parsimonious trees. To 
confirm that the trees of Liu et al.’ are less parsimonious, a second pair 
of analyses was undertaken. Here tree topologies were constrained to 
the published strict-consensus topology (Fig. 1a), again using implied 
(k = 2) and equal weighting schemes. In both cases a single tree was 
found, substantially longer than those our first analyses recovered 
(16.4 as opposed to 12.4 steps under implied weighting; 119 as 
opposed to 89 steps under equal weighting). This demonstrates that 
the published topology is far from the most parsimonious that can be 
obtained from the data set. The use of a branch and bound (rather 
than heuristic) search indicates that this discrepancy is not a result of 
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Figure 1 | Phylogeny of Cambian lobopodians and stem-group arthropods. 
a, Topology proposed by Liu et al.' (implied weights, k = 2, 16.4 steps, 
consistency index (CI) = 0.336, retention index (RI) = 0.546; equal weights, 
119 steps, CI = 0.336, RI = 0.546). b, Strict consensus of five most 
parsimonious trees using the methods of Liu et al.' (implied weights, k = 2, 12.4 
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chance failure to hit upon the best solution; instead we believe that it 
reflects a methodological error in the original analysis. 

The phylogeny recovered here suggests a polyphyletic origin of 
arthropodized limbs, where the arthropodization in Diania was acquired 
separately to that of the crown-group arthropods. It does not rule out the 
‘leg-first’ (as opposed to arthrodization-first) scenario posited by Liu et 
al.', but provides no support for it either. The recovered polychotomy 
demonstrates that the character coverage of Liu et al.’ is inadequate to 
resolve the interrelationships of lobopodians; a more comprehensive 
data set is required to assess the position and significance of Diania 
properly. We note, however, that our results from the Liu et al.’ data 
set (Fig. 1b) place the euarthropods within the dinocaridids (radiodonts 
and related taxa), and as such are similar to other analyses° * which have 
used more character- and taxon-rich data sets. The origin of arthropo- 
dized trunk limbs is problematical in this topology, as dinocaridids 
apparently lack any form of leg-like trunk appendage’. The increasingly 
detailed fossil record of stem-group euarthropods provides our best 
chance of resolving this issue, but as yet has failed to do so; unequivocal 
evidence for any particular ordering of acquisition in these characters is 
not yet available. Diania is a fascinating animal, but in its revised position 
it contributes little to this debate. 
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REPLYING TO R. C. P. Mounce & M. Wills Nature 476, doi:10.1038/nature10266 (2011); D. A. Legg et al. Nature 476, doi:10.1038/naturel10267 


(2011); 


We welcome the reanalyses by Mounce and Wills' and Legg et al.’ of 
our paper’, and although we do not fully concur with their conclu- 
sions we are pleased that Diania has reopened the debate about key 
stages in arthropod evolution. We accept that the position of this fossil 
remains sensitive to parameters of analysis and in the original pub- 
lication we conceded that our best-supported tree—Diania as sister- 
group to (Schinderhannes + Euarthropoda)—could be subject to 
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change, and that the ‘walking cactus’ may have a more basal position 
within the overall framework of the arthropod stem-group. These 
alternative treatments of our data would seem to confirm this sus- 
picion, although we find the placement of Diania in an unresolved, 
and extremely basal, polytomy alongside velvet worms, tardigrades 
and various other lobopodians similarly problematical. We do not 
doubt that the authors’ results'” are statistically well supported, but 
what do these cladograms tell us about the evolution of the group? 
Lobopodians are, by their nature, fairly simple and consequently yield 
few convincing synapomorphies, either with each other or with 
arthropods in general. As we discovered, this makes scoring a robust 
data matrix including both lobopodians and arthropods challenging, 
and we wonder whether the basal polytomies recovered here are 
simply due to clustering among taxa with few unequivocal apomor- 
phies and/or much missing data. 

Our original placement of Diania close to the euarthropods was 
strongly influenced by the character of jointed trunk appendages. We 
acknowledge an error in scoring characters 17 and 27 of Tardigrada, 
which should have been 0 and 1, not 1 and 0, respectively. We also accept 
the criticism by Mounce and Wills’ that parent-daughter characters are, 
to some extent, dependent upon one another and can artificially inflate 
support for particular clades. This situation is hard to avoid when select- 
ing characters across a range of lobopod/arthropod fossils, which 
include taxa with unusual morphologies (for example, Opabinia) or 
where there are alternative hypotheses for the homology of a given 
lobopod feature and its probable arthropod equivalent: should a lobo- 
pod/dinocaridid flap be scored the same way as a euarthropod limb 


Figure 1 | Reanalysis of our data using PAUP: bootstrap 50% majority-rule 
consensus tree. Hierarchical structure in the data was assessed using the PTP 
test® as implemented in PAUP. Parsimonious trees were found through a 
heuristic search strategy using tree bisection-reconnection (TBR) branch 
swapping. In an initial analysis, all characters were treated as unordered. 
Branch support was based on calculating bootstrap values (10,000 replicates 
and with rearrangement limit of 10,000,000 rearrangements per additional 
sequence). A significant value in the PTP test (P = 0.01) does suggest the 
presence of a phylogenetic signal in the morphological data, supporting the 
given topology. Thirty-eight characters were parsimony-informative. The 
heuristic search for a maximum parsimony solution resulted in only one 
parsimonious tree (length = 121, homoplasy index (HI) = 0.67, consistency 
index (CI) = 0.33, retention index (RI) = 0.55, excluding 1 uninformative 
character). This tree contains 26 resolved nodes, of which many were strongly 
supported by bootstrap values (100+) and in some nodes bootstrap values 
showed even greater resolution. 
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exite; could both be subsumed under a ‘biramous limb’, etc.? 
Furthermore, a disadvantage of PAUP is that ‘gaps’ are automatically 
treated as missing in the analysis, and the only way to distinguish 
between them is to treat the gap as a new state; although a gap in our 
analysis refers to an inapplicable character, which is itself not a new state. 

In our initial analysis data were entered into a matrix using 
MacClade version 3.05 (ref. 4). Analyses were performed with PAUP 
version 4.0b10 (ref. 5), whereby multi-state characters were treated as 
‘unordered’, other characters were treated as ‘ordered’, and a branch 
and bound search under implied weights (k = 2) produced many most 
parsimonious trees—including trees similar to those of Legg et al.’ and 
Mounce and Wills’. For the purpose of character state mapping, we 
selected one of these most parsimonious trees (see Fig. 4 in ref. 3) which 
we felt best reflected the complex relationships among lobopodians 
and arthropods. Also, a constrained analysis was conducted, and three 
monophyletic groups (lobopodians + Onychophora, Radiodonta, 
Arthropoda) were enforced. To verify the stability of this tree, we 
conducted bootstrap analysis and the result (Fig. 1) shows that the 
contained 26 resolved nodes of this tree were also strongly supported 
by bootstrap values (100+) and in some nodes bootstrap values 
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Figure 2 | Tree shows the Bremer support value, which strongly supports 
Diania being sister group to Arthropoda (Bremer index = 4). Four 
additional steps would be required to disrupt the Diania-arthropod 
relationship. As we noted in the original description, Diania is the most 
arthropod-like of the lobopodians recorded so far—at least in terms of limb 
morphology—and we feel that it is better placed on the arthropod stem, rather 
than in an unresolved polytomy. 
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showed more resolution. Additionally, a significant value of the parti- 
tioning tail permutation (PTP) test (P = 0.01) suggests the presence of 
a clear phylogenetic signal in the morphological data, also strongly 
supporting the topology shown. Furthermore, a test of Bremer support 
still supports our initial analysis (Fig. 2). 

Legg et al.” drew attention to the absence of sclerotized trunk limbs in 
dinocaridids (Anomalocaris, etc.), which remains for us one of the great 
puzzles of stem-group arthropod evolution. Put simply, dinocaridids 
have a more arthropod-like head region (cephalisation, eyes, sclerotized 
mouthparts), whereas Diania lacks such sophistication in the anterior 
body region, but has jointed trunk appendages. One solution to this 
puzzle would be to assume that dinocaridids also had trunk limbs, but 
lost them secondarily. In this scenario Diania could sit comfortably as 
sister group to a (Dinocaridida + Arthropoda) clade: all three sharing 
jointed appendages, the latter two sharing cephalisation, etc. 

We accept that the tree of Legg et al.” and Mounce and Wills’ is the 
strict consensus tree, but we feel that the strict tree is, in this case, too 
conservative to provide meaningful information about the early 
evolution of the arthropods. This was undoubtedly a complex process, 
and may have involved numerous parallel developments introducing 
homoplasy. Under these circumstances, the comb-like most parsimo- 
nious tree is unhelpful and exemplifies a disadvantage of parsimony in 
this instance when faced with a complex early radiation. Indeed, Legg 
et al.’ recognized that the strict consensus tree(s) “suggests a poly- 
phyletic origin of arthropodized limbs”. If true, this would be signifi- 
cant. In this context, Mounce and Wills’ seem to have overlooked the 
potential significance of their reanalysis of our data. Much evidence 
has been accumulated that Arthropoda is monophyletic, but the 
logical conclusion of both reanalyses is that jointed legs are homo- 
plastic, at least within the lobopodian-arthropod assemblage. The 
implications of this are not trivial, bringing us close to the (largely 
discredited) Manton school of thought in which jointed appendages 
were proposed to have evolved in parallel in different (here euarthro- 
pod) lineages. At a fundamental level we need to know how easy it is to 
turn a soft, lobopodian limb into a sclerotized arthropod one, and 
whether this happened once, twice or on multiple occasions. For this 
reason alone, we believe that Diania is not merely fascinating, but 
remains invaluable to the evolutionary debate by challenging our 
notion of what it means to be an arthropod. 
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High plant diversity is needed to maintain ecosystem 


services 


Forest Isbell!, Vincent Calcagno', Andy Hector’, John Connolly’, W. Stanley Harpole’, Peter B. Reich®’®, Michael Scherer-Lorenzen’, 
Bernhard Schmid’, David Tilman’, Jasper van Ruijven”, Alexandra Weigelt’’, Brian J. Wilsey*, Erika S. Zavaleta'’ & Michel Loreau! 


Biodiversity is rapidly declining worldwide’, and there is consensus 
that this can decrease ecosystem functioning and services”’. It 
remains unclear, though, whether few® or many’ of the species in 
an ecosystem are needed to sustain the provisioning of ecosystem 
services. It has been hypothesized that most species would promote 
ecosystem services if many times, places, functions and environ- 
mental changes were considered’; however, no previous study has 
considered all of these factors together. Here we show that 84% of 
the 147 grassland plant species studied in 17 biodiversity experi- 
ments promoted ecosystem functioning at least once. Different 
species promoted ecosystem functioning during different years, 
at different places, for different functions and under different 
environmental change scenarios. Furthermore, the species needed 
to provide one function during multiple years were not the same as 
those needed to provide multiple functions within one year. Our 
results indicate that even more species will be needed to maintain 
ecosystem functioning and services than previously suggested by 
studies that have either (1) considered only the number of species 
needed to promote one function under one set of environmental 
conditions, or (2) separately considered the importance of bio- 
diversity for providing ecosystem functioning across multiple 
years’”"'*, places’*'*, functions'*’”'* or environmental change 
scenarios'~’”*’. Therefore, although species may appear functionally 
redundant when one function is considered under one set of 
environmental conditions’, many species are needed to maintain 
multiple functions at multiple times and places in a changing 
world. 

Arguments for biodiversity conservation are often based on eco- 
system services, but it remains unclear whether few* or many’ species 
are needed to maintain ecosystem services. Determining how many 
species provide ecosystem services will require a synthesis of several 
areas of biodiversity research (Fig. 1). Biodiversity-ecosystem func- 
tioning studies have often considered a single functional context and 
found that multiple, but not all, study species promoted ecosystem 
functioning®’ (Fig. 1a). We define a functional context (henceforth 
context) as the measurement of one function, at one time and place, 
under one environmental change scenario. Several related biodiversity 
studies have explored whether more species promote ecosystem func- 
tioning when more than one context is considered. For example, bio- 
diversity-ecosystem stability (that is, the invariability of productivity) 
studies have found that more species are needed to provide ecosystem 
functioning at larger spatio-temporal scales because different species 
promote productivity at different times’”* (Fig. 1b) or places’*"®. 
Biodiversity-ecosystem multifunctionality studies have found that 
more species are needed to provide multiple functions because differ- 
ent species promote different functions'*'”’* (Fig. 1c). Biodiversity- 
global change studies have found that more species are needed to 


provide ecosystem functioning in a changing world because different 
species promote ecosystem functioning under different environmental 
change scenarios’. Here, for the first time to our knowledge, we con- 
sider all of these relationships together. 

We included data from 17 grassland biodiversity experiments that 
considered multiple times, places, functions or environmental change 
scenarios (Supplementary Table 1 and Supplementary Data). To test 
whether different species promoted ecosystem functioning during dif- 
ferent years, we included studies that planted replicate plots (same 
species compositions) during consecutive years** or made repeated 
measurements of ecosystem functions across years'”!”?!???*°, To test 
whether different species promoted ecosystem functioning at different 
places, we included studies that planted replicate plots (same species 
compositions, with one exception’) at multiple sites across Europe*® 
or multiple spatial blocks within a site'””*”°. To test whether different 
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Figure 1 | Some of the ways that biodiversity can be important for 
ecosystem functioning. Each of the eight symbols represents a species. Species 
shown in the bivariate plots are those that promoted ecosystem functioning 
within each functional context (for example, context Al might be above- 
ground biomass measured during 2001). Although this figure defines contexts 
in two dimensions for simplicity, we considered four dimensions (Fig. 3). 
Previous studies have considered (a) one context, or (b, c) one dimension of 
contexts (b, ecosystem stability studies; c, ecosystem multifunctionality 
studies). Figure 2 tests the question in d, by comparing the one-dimensional 
overlap (for example, between Al and A2) with the two-dimensional overlap 
(for example, between Al and B2) and three-dimensional overlap (that is, a pair 
of contexts that differs in three ways; not shown). Our results reject each of the 
null hypotheses shown on the left in a-d. 
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species promoted different functions, we included studies that mea- 
sured several functions'*'””’, such as biomass production and nutrient 
uptake. Many of these functions are considered to be supporting eco- 
system services because other types of ecosystem services depend on 
them*”’. To test whether different species promoted ecosystem func- 
tioning under different environmental change scenarios, we included 
studies that applied environmental change treatments, such as nutrient 
and CO) enrichment”, precipitation changes”! or land use changes 
such as livestock grazing” and haying”. 

We began by identifying the sets of study species that influenced 
ecosystem functioning within each context. Species were considered to 
promote ecosystem functioning in a particular context if they had 
effects in the direction that would usually be considered desirable from 
an ecosystem services perspective’’. Positive effects were considered 
desirable for all functions except for soil inorganic nitrogen and light 
availability at ground level, where negative effects are consistent with 
lower levels of unconsumed resources’. We did not use separate defi- 
nitions of desirable effects for different species (for example, positive 
effects of legumes on soil nitrogen might be considered desirable) to be 
consistent with previous studies'”'’, to be conservative and because it 
may not be possible to manage simultaneously for both positive and 
negative effects. We found that approximately 27% of the study species 
promoted ecosystem functioning within any particular context, 
regardless of the size of the study species pool (Fig. 2a). Note that if 
many species were functionally redundant, or if only the most com- 
mon species promoted ecosystem functioning, then we would expect a 
saturating relationship in Fig. 2a. Instead, our results suggest that even 
rare species can promote ecosystem functioning. 

After identifying the sets of species that promoted ecosystem func- 
tioning in each context, we tested whether different sets of species 
promoted ecosystem functioning in different contexts. We used 
Sorensen’s similarity index to quantify overlap between species sets’”. 
All comparisons were made within studies so that differences between 
pairs of contexts were not due to sampling from multiple species pools. 
First, we quantified one-dimensional overlap between all pairs of con- 
texts that differed in only one way (Fig. 1b, c). For example, multi-year 
overlap was quantified between each pair of contexts that differed only 
in which year ecosystem functioning was measured (that is, same 
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Figure 2 | Sets of study species that promoted ecosystem functioning. a, The 
mean number of species that promoted ecosystem functioning within each 
context increased linearly (f = 16.40, P< 0.001, R= 0.944) with the size of the 
species pool, such that approximately 27% (mean, 95% confidence intervals for 
slope: 0.27, 0.24-0.30) of the study species promoted ecosystem functioning 
within each context. Error bars for each study indicate 95% generalized linear 
model confidence intervals. b, Different sets of species promoted ecosystem 
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place, function and environmental change scenario in both contexts). 
A multi-year overlap value of one or zero would respectively indicate 
that completely identical or completely unique sets of species pro- 
moted ecosystem functioning during different years, independent of 
the other sources of variation. We found overlap values between these 
two extremes, which indicates that somewhat different sets of species 
promoted ecosystem functioning during different years, at different 
places, for different functions and under different environmental 
change scenarios (Fig. 2b). 

After considering these sources of variation independently, we 
quantified multi-dimensional overlap between pairs of contexts that 
differed in two or three ways (Fig. 1d). Again, all comparisons were 
made within studies. We found that the average overlap between pairs 
of contexts decreased as the number of differences between contexts 
increased (Fig. 2b and Supplementary Fig. 1). This means that, for 
example, the identities of the additional species needed to provide 
one function during multiple years were not the same as the identities 
of the additional species needed to provide multiple functions during 
one year (Fig. 1d). Additionally, species sets did not simply vary inde- 
pendently of context attributes (permutation test P< 0.001 for one-, 
two- and three-dimensional overlap) (Fig. 2b). Thus, our results indi- 
cate that even more species will be needed to maintain ecosystem 
functioning and services than previously suggested by studies that have 
either (1) considered only the number of species needed to promote one 
function under one set of environmental conditions, or (2) separately 
considered the importance of biodiversity for providing ecosystem 
functioning across multiple years'’*, places’*'®, functions’*’”"* or 
environmental change scenarios’*"”-”. Future studies could more 
completely consider the consequences of biodiversity declines for eco- 
system functioning and services by similarly considering the multi- 
dimensionality of ecosystem functioning both in experimental and 
natural communities. 

Next, we quantified the extent to which the number of species 
promoting ecosystem functioning increased as more years, places, 
functions or environmental change scenarios were considered within 
each study. In other words, we quantified the accumulation of species 
across each of the four dimensions of contexts that we considered 
(Fig. 1). We found that a greater proportion of species promoted 
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functioning in different contexts (overlap < 1), and overlap between pairs of 
contexts decreased as the number of differences between contexts increased 
(see Fig. 1). Symbols indicate means for each specific type of overlap; horizontal 
dotted lines show + 95% permutation test confidence intervals; error bars for 
symbols and bars indicate 95% bootstrap confidence intervals. Supplementary 
Data indicates numbers of contexts for each study. 
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Figure 3 | The proportion of study species that promoted ecosystem 
functioning increased when more (a) years, (b) places, (c) ecosystem 
functions and (d) environmental change scenarios were independently 
considered. Solid blue lines indicate generalized linear model fits for each 
study; dashed red line indicates grand mean generalized linear model fitted 


ecosystem functioning when more years, places, functions or environ- 
mental change scenarios were considered (Fig. 3). These relationships 
result from different species promoting ecosystem functioning in dif- 
ferent contexts (Fig. 2b). Note that if the one-dimensional overlap 
values corresponding to each panel in Fig. 3 were one or zero, then 
these relationships would be horizontal or linearly increasing, respec- 
tively'’. Our results are between these two extremes. 

After comparing contexts within studies, data from all studies were 
combined to consider how the total number of species that promoted 
ecosystem functioning increased with the total number of contexts. 
We quantified the number of species that promoted ecosystem func- 
tioning in a random subset of all possible combinations of our 
observed contexts (that is, 100 pairs, 100 groups of three, etc.). The 
large increase in the number of species that promoted ecosystem 
functioning as more contexts were considered (Fig. 4) is the result of 
different species promoting ecosystem functioning during different 
years, at different places, for different functions, under different environ- 
mental change scenarios and in different species pools. Considering all 
of these factors together suggests that many species will be needed to 
maintain ecosystem multifunctionality at large spatio-temporal scales 
in a changing world. Consequently, the extinction (or decreased local 
occurrence) of almost any of these species is expected to decrease 
ecosystem functioning and services in at least some contexts. Further 
study is needed to identify the processes that explain why different 
species promoted ecosystem functioning in different contexts. The 
specific mechanisms involved probably differed across contexts, but 
previous results from these and other biodiversity experiments” sug- 
gest that complementarity (in time, space, functional effect traits and 
functional response traits) is a general explanation for this pattern. 

Our results reveal new opportunities and challenges for prioritizing 
conservation efforts and predicting consequences of biodiversity 
declines. According to the precautionary principle, all species should 
be conserved because we cannot be certain which species actually 
provide ecosystem services”. Our results offer further support for 
the precautionary principle because most of the studied species were 
important at least once, and species exhibited context-dependent 
effects that will be difficult to predict. If it is impossible or impractical 
to conserve all species, then future studies could additionally consider 
how often (Supplementary Fig. 2 and Supplementary Data) and how 
much species influenced ecosystem functioning to determine which 
species are most important for maintaining ecosystem functioning and 
services. This will require careful consideration of many contexts, 
because it is not possible to make general predictions or conclusions 
by considering a few context-dependent phenomena”. 

Future studies could determine whether some species consistently 
promote ecosystem functioning under environmental conditions that 
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across all studies. Box plots summarize observed data: black band, median; 
bottom and top of boxes respectively correspond to lower and upper quartiles; 
error bars, + 1.5 times the interquartile range. See Supplementary Data for the 
specific years, places, functions and environmental change scenarios 
considered in each study. 


are currently common, or under environmental change scenarios that 
will probably become increasingly common. Note that even species 
that have small effects could be important for maintaining ecosystem 
functioning and services if they have a large cumulative desirable effect 
across many contexts. For example, Eriochloa sericea had the smallest 
desirable effect on above-ground biomass in the irrigated plots at the 
MEND Irrigation experiment during 2009, but promoted ecosystem 
functioning in 75% of the contexts in which it was included. Future 
studies could also determine which species promote ecosystem func- 
tioning in particular contexts that are highly valued by stakeholders. 
Note that even species that rarely promote (or often decrease) eco- 
system functioning could be most important for maintaining eco- 
system functioning and services within some contexts. For example, 
although Pascopyrum smithii only promoted ecosystem functioning in 
2% of the contexts in which it was included, it promoted soil carbon 
more than any other species in the Cedar Creek Biodiversity experi- 
ment during 2004. Furthermore, note that declines in local diversity, 
which are far more common than global extinctions, will also decrease 
ecosystem functioning and services within some contexts. Finally, even 
the few species that never promoted ecosystem functioning in these 
studies (Supplementary Fig. 2) could promote ecosystem functioning 
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Figure 4 | The number of study species that promoted ecosystem 
functioning increased with the number of contexts considered across all 
studies. The points are the number of species that promoted ecosystem 
functioning when 1-557 contexts were sampled from all 557 contexts. The 
dashed line indicates the total number of studied species (147), which restricts 
the upper limit for these values. The x axis includes variation across years, 
places, functions, environmental change scenarios and species pools. 
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in other contexts, or be a conservation priority for other (for example, 
ethical, aesthetic) reasons. Therefore, we encourage careful considera- 
tion of many contexts when making conservation decisions and pre- 
dicting the consequences of biodiversity declines. 


METHODS SUMMARY 

Identifying the species that promoted ecosystem functioning. For each context, 
we modelled ecosystem functioning response variables as a function of the pres- 
ence or absence of each study species at the plot level. We used backward- 
elimination multiple regression to identify a minimally adequate model, based 
on the Akaike information criterion (AIC)'’. This procedure was performed with 
the stepAIC function in the MASS package of R 2.11.1 (see Methods). Species were 
considered to promote ecosystem functioning in a context if they were included in 
the minimally adequate model and had effects in the direction that would usually 
be considered desirable from an ecosystem services perspective’’. 

Comparing sets of species between pairs of contexts. We used Sorensen’s 
similarity index to quantify the overlap between the sets of species that promoted 
ecosystem functioning in pairs of contexts within each study'’. This allowed us to 
test whether identical (overlap = 1), unique (overlap = 0) or somewhat different 
(0 < overlap <1) sets of species promoted ecosystem functioning in different 
contexts. We also tested whether overlap decreased as the number of differences 
between contexts increased (Fig. 1). 

Accumulation of species across across contexts. We quantified the accumula- 
tion of species across each of the four dimensions of contexts that we considered 
(Methods; Fig. 1). A quasi-binomial generalized linear model was fitted to deter- 
mine how the proportion of species that promoted ecosystem functioning 
increased with the number of years (or places, functions, environmental changes), 
including ‘study’ as a factor. This allowed us to describe the mean trends within 
and across studies. We also randomly sampled combinations (that is, 100 pairs, 
100 groups of three, and so on) of all 557 contexts to determine how the number of 
species promoting ecosystem functioning increased when all of these factors were 
considered together. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Identifying the species that promoted ecosystem functioning. For each context, 
we modelled ecosystem functioning response variables as a function of the pres- 
ence or absence of each study species at the plot level. We used a backward- 
elimination multiple regression analysis to identify a minimally adequate model, 
based on the AIC’’. This procedure was performed with the stepAIC function in 
the MASS package of R 2.11.1. Specifically, for each context, we started with a full 
model that included a main effect for each study species and an intercept. The 
stepAIC function then removed each species, one at a time, from this full model 
and compared the AIC values of the resulting simpler models with the AIC value of 
the full model. If the AIC value for any of the simpler models was smaller than the 
AIC value for the full model, then the variable whose removal resulted in the 
largest decrease in AIC was permanently removed from the full model. This 
backward-deletion process was repeated until the removal of any species resulted 
in a model with a higher AIC value. The minimally adequate model that resulted 
from this process contained the most parsimonious set of species influencing 
ecosystem functioning. These species were considered to promote ecosystem 
functioning if they had effects in the direction that would usually be considered 
desirable from an ecosystem services perspective’’. Positive effects were considered 
desirable for all functions except for soil inorganic nitrogen and light availability at 
ground level, where negative effects are consistent with lower levels of unconsumed 
resources’’. We did not use separate definitions of desirable effects for different 
species (for example, positive effects of legumes on soil nitrogen might be con- 
sidered desirable) to be consistent with previous studies'””*, to be conservative and 
because it may not be possible to manage simultaneously for both positive and 
negative effects. 

This modelling approach is conservative in several ways. Previous approaches for 
determining the number of species that promoted ecosystem functioning could have 
been biased by not allowing species to decrease ecosystem functioning"* or by not 
identifying which species actually influenced ecosystem functioning"*. Furthermore, 
including species interactions in these or other models would probably increase the 
proportion of study species that promoted ecosystem functioning within each con- 
text'*’®, For example, some species perform poorly in monocultures, but interact 
positively with other species in mixtures (for example, grass—legume interactions)*’. 
Our approach would underestimate the desirable effects of these species. Also, the 
presence or absence model that we used would be especially conservative for experi- 
mental designs that concentrate on varying evenness rather than richness, such as 
the simplex design used in the Agrodiversity study*®. Additionally, our results were 
qualitatively similar, but less conservative, when we used a model-averaging 
approach implemented in the glmulti package (version 0.6-3) of R®. 

To compare the number of species that promoted ecosystem functioning across 

studies, we used a quasi-Poisson generalized linear model with ‘study’ as a main 
effect. This procedure was performed with the generalized linear model function in 
the stats package of R. The quasi-maximum-likelihood version of the Poisson 
generalized linear model accounts for over- or under-dispersion in the data. We 
used the confint.glm function in the MASS package of R to obtain the 95% 
confidence intervals. To quantify the proportion of study species that promoted 
ecosystem functioning, we regressed the mean number of species that promoted 
ecosystem functioning per context on the number of study species (that is, the 
number of planted species that were present at least once in biomass samples) with 
no intercept. 
Comparing sets of species between pairs of contexts. After identifying the sets of 
species that promoted ecosystem functioning in each context, we quantified the 
overlap between these sets of species to test whether different sets of species 
promoted ecosystem functioning in different contexts. All overlap comparisons 
were made within studies. Overlap between contexts a and b was quantified by 
Sorensen’s similarity index’: 


ye lat EsI 
0.5(|Ea| +|Eo|) 


where |E,| is the number of species that promoted ecosystem functioning in 
context a and |E,(\E;| is the number of species that promoted ecosystem func- 
tioning in both contexts. First, one-dimensional overlap was quantified between 
pairs of contexts that only differed in one way (Fig. 1b, c). For example, multi-year 
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overlap was quantified between each pair of contexts that differed only in which 
year ecosystem functioning was measured (that is, place, function and environ- 
mental change scenarios were the same in both contexts). This allowed us to test 
whether identical (overlap = 1), unique (overlap =0) or somewhat different 
(0 < overlap <1) sets of species promoted ecosystem functioning in different 
contexts. Next, multi-dimensional overlap was quantified between pairs of con- 
texts that differed in two or three ways (Fig. 1d). This allowed us to test whether 
overlap decreased as the number of differences between contexts increased (Fig. 1). 

For each type of overlap, we used non-parametric bootstrap, with correction for 
bias****, to build 95% confidence intervals. We re-sampled the observed contexts 
with replacement to generate each of 1,000 bootstrap data sets. For each of these 
bootstrap data sets, we computed the average overlap across all pairs of contexts 
that were relevant (that is, for the particular type of overlap being considered). We 
used this bootstrap approach to control for the non-independence of pairwise 
comparisons that had one context in common. We then used an exact permuta- 
tion approach™ to test whether the observed overlap values differed from the 
expected null value (the null hypothesis being that overlap varied across contexts 
independent of context attributes: year, place, function and environmental change 
scenario). Within each study, we permuted context data (that is, the sequence of 
ones or zeros indicating whether each species had a desirable effect or not) with 
respect to context attributes. For each of 1,000 permutations, we computed the 
average overlap across all pairs of contexts that were relevant (that is, for the 
particular type of overlap being considered). We used the null distribution of 
overlap obtained from these random permutations to test whether the observed 
value was significantly lower (that is, in the lowest 2.5 percentiles) or significantly 
higher (that is, in the top 2.5 percentiles) than the null. We used this test because it 
does not assume that study species had independent responses to context attri- 
butes, and thus it should be robust to the presence of functional groups. 

Note that there could be correlations between contexts owing to repeated mea- 

surements across years. These correlations could influence overlap estimates and 
tests that assume independent observations. For example, positive correlations 
between repeated measurements during consecutive years could lead to (1) over- 
estimates of multi-year (Y) overlap, and (2) smaller estimates of overlap between 
two different years at two different places than between two different years at one 
place (that is, Y and places (P) overlap less than Y overlap). These potential 
correlations had little influence on our results because multi-year overlap esti- 
mates were much less than 1, and Y and P overlap estimates were not less than Y 
overlap estimates (Fig. 2). Nevertheless, we encourage future studies to consider 
these correlations when interpreting overlap estimates. 
Accumulation of species across contexts. We quantified the accumulation of 
species across each of the four dimensions of contexts that we considered (Fig. 1). 
For example, above-ground biomass was sampled during 13 years at the Cedar 
Creek Biodiversity experiment. To determine the extent to which more species 
promoted above-ground biomass as more years were considered, we sampled all 
combinations of these 13 contexts (that is, all pairs, groups of three, etc.), and 
recorded the number of unique species that promoted ecosystem functioning, and 
the number of unique years, for each combination. This was repeated for each 
function, at each place, under each environmental change scenario. These results 
are summarized in Fig. 3a. A quasi-binomial generalized linear model was fitted to 
determine how the proportion of species that promoted ecosystem functioning 
increased with the number of years, including ‘study’ as a factor. This allowed us to 
describe the mean trends within and across studies. We also randomly sampled 
combinations (that is, 100 pairs, 100 groups of three, etc.) of all 557 contexts to 
determine how the number of species promoting ecosystem functioning increased 
when all of these factors were considered together. 
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Polar actomyosin contractility destabilizes the 
position of the cytokinetic furrow 


Jakub Sedzinski!?*, Maté Biro’?*, Annelie Oswald!, Jean-Yves Tinevez)7+, Guillaume Salbreux® & Ewa Paluch’? 


Cytokinesis, the physical separation of daughter cells at the end of 
mitosis, requires precise regulation of the mechanical properties of 
the cell periphery’*. Although studies of cytokinetic mechanics 
mostly focus on the equatorial constriction ring’, a contractile 
actomyosin cortex is also present at the poles of dividing cells**. 
Whether polar forces influence cytokinetic cell shape and furrow 
positioning remains an open question. Here we demonstrate that 
the polar cortex makes cytokinesis inherently unstable. We show 
that limited asymmetric polar contractions occur during cytokinesis, 
and that perturbing the polar cortex leads to cell shape oscillations, 
resulting in furrow displacement and aneuploidy. A theoretical 
model based on a competition between cortex turnover and contrac- 
tion dynamics accurately accounts for the oscillations. We further 
propose that membrane blebs, which commonly form at the poles of 
dividing cells’ and whose role in cytokinesis has long been enigmatic, 
stabilize cell shape by acting as valves releasing cortical contractility. 
Our findings reveal an inherent instability in the shape of the divid- 
ing cell and unveil a novel, spindle-independent mechanism ensuring 
the stability of cleavage furrow positioning. 

Cell division is driven by the constriction of an actomyosin ring 
at the cell equator. However, throughout cytokinesis, a substantial 
actomyosin cortex remains in regions outside of the equatorial ring 
(the cell poles), both in cultured cells°, and in vivo’* (Supplementary 
Fig. 1 and Supplementary Movie 1). Mechanical studies indicate that 
the polar cortex regulates cleavage dynamics by generating resistive 
forces that slow furrow ingression®” (Supplementary Model section 1). 
The polar cortex is also able to exert stresses, for example to correct for 
shape perturbations'®. Moreover, the formation of membrane blebs, 
known to be caused by cortical contractions, is commonly observed at 
the poles of dividing cells’ (Supplementary Movie 1). However, how 
polar contractile forces contribute to the overall cytokinetic cell shape 
is poorly understood. 

During asymmetric division, unbalanced contraction of one of the 
poles has been shown to directly generate different-sized daughters in 
Caenorhabditis elegans QR.a neuroblasts'’. In symmetrically dividing 
cells, depletion of the actin-binding proteins anillin'’**, supervillin’, 
profilin or diaphanous", as well as astral microtubule stabilization”, 
result in improper polar contractions, leading to oscillations of the 
furrow position and division failure. It is, however, not known whether 
the poles influence furrow positioning in untreated cells. Reasoning 
that if substantial contractile forces are exerted at the poles, shape 
instabilities might arise (Fig. 1a), we asked whether a precise balance 
of polar forces is necessary for accurate symmetric division and, if so, 
how the cell achieves such a balance. 

We first investigated whether asymmetries in polar forces could be 
observed during cytokinesis. To this aim, we quantified cytoplasm 
movements in dividing HeLa and L929 fibroblasts, as an imbalance 
in polar contractions would result in cytoplasm transfer between the 
poles. We detected cytoplasmic flows of amplitude at least two times 
higher than noise level (estimated using metaphase cells as a reference) 


in 20% of HeLa (n = 20) and 25% of L929 cells (n = 40) (Fig. 1b, ¢, 
Supplementary Fig. 2 and Supplementary Movie 2). In most cases, the 
direction of the flows alternated, resulting in small oscillations. 
Strikingly, observations of a large number of dividing cells revealed 
that in about 2% of HeLa (m = 282) and L929 (n = 526) cells (Sup- 
plementary Table 1), dramatic shape oscillations occurred, where each 
pole fully contracted before reversal of the flow direction, destabilizing 
the equatorial position of the furrow and leading to division failure 
(Supplementary Movie 3). To test whether the shape instabilities were 
driven by cortex contractions, we used Y27632, a Rho-kinase inhibitor 
that, at low doses, decreases myosin activity, and thus cortical tension'®, 
without preventing cytokinesis’’. No cytoplasmic flows could be 
detected during cytokinesis in Y27632-treated cells (Fig. 1b, c and 
Supplementary Fig. 2, n = 10), supporting the view that cortical con- 
tractions drive the observed flows. We then tested the involvement of 
spindle microtubules, which influence polar contractility*®*'. Spindle 
disassembly with nocodazole increased the occurrence of shape 
instabilities (Supplementary Fig. 3, Supplementary Movie 4 and 
Supplementary Table 1), suggesting that the spindle is unlikely to drive 
cytokinetic oscillations. Taken together, our observations suggest that 
actomyosin contractions cause shape instabilities during cytokinesis, 
which can lead to division failure if their amplitude is not limited. 

To assess which factors may trigger shape instabilities, we developed 
a minimal physical model of a cytokinetic cell. The dividing cell with a 
cortex under contractile tension is reminiscent of two connected soap 
bubbles, where Laplace pressure resulting from surface tension is 
known to lead to instabilities’ (Fig. 1d). Similarly, in a dividing cell, 
cortical tension generates hydrostatic pressure at each pole’. However, 
the cell interior is a poro-elastic material”’ that resists deformations 
and could stabilize cell shape against cortex-driven instabilities 
(Fig. 1d). In a first approximation, we assumed that each pole has a 
constant tension T and a linear bulk elasticity K (Supplementary Fig. 4a 
and Supplementary Model section 2), which accounts for the effects of 
cellular elastic components and of substrate adhesions (Supplemen- 
tary Fig. 5, Supplementary Movie 5 and Supplementary Model 
section 5). Previous mechanical models of the polar cortex in cytokinesis 
have focused on symmetric cells®’, while we allowed variations in pole 
sizes. A stability analysis reveals that above a critical value of a stability 
parameter T/RoK (with Ro the radius of one daughter cell), any 
fluctuation in volume or tension destabilizes the symmetric cell shape 
and leads to cytosol flows between the poles (Supplementary Fig. 4b). 
Moreover, if a large enough difference in tension is introduced 
between the two poles, no bipolar solution for the shape of the cell 
can be found (Supplementary Fig. 4c). 

To test whether an imbalance in contractile forces between the poles 
indeed destabilizes cell shape, we artificially introduced such a tension 
imbalance. Using a double-pipette local delivery setup (Fig. 2a), we 
disrupted the cortex at one pole of dividing HeLa cells by short applica- 
tion of the actin-depolymerizing drug cytochalasin D. Local polar cortex 
disruption resulted in the formation of a large bleb and destabilized the 
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shape of the cell, triggering extensive shape oscillations (Fig. 2b and 
Supplementary Movie 6; see Supplementary Table 1). As an alternative 
method for cortex disruption, we used laser ablation’® (Fig. 2c). Polar 
cortex ablation also triggered bleb formation, followed by shape oscilla- 
tions and division failure in a large portion of cytokinetic HeLa (Fig. 2d 
and Supplementary Movie 7, 35% of 20 cells) and L929 cells (50% of 20 
cells). These experiments indicate that introducing a tension imbalance 
between the poles of a cleaving cell can trigger shape instabilities. 

Our physical description also predicts that a global polar tension 
increase favours shape instabilities (Supplementary Fig. 4b). Cytokinetic 
shape oscillations had previously been observed upon depletion of the 
actin-binding scaffold protein anillin'*"'*. Anillin depletion affects the 
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Figure 1 | The shape of the cleaving cell displays inherent shape 
instabilities. a, Polar cortex contraction could destabilize the cleavage furrow 
position. b, Stills from differential interference contrast (DIC) time-lapses of 
cytokinesis in control and Y27632-treated cells. Green and red arrows, 
rightward and leftward particle image velocimetry flow fields respectively; scale 
bar, 10 jim. c, Mean velocity of the flow field projected on the long cell axis as a 
function of time in the control (red circles) and the Y27632-treated (blue 
circles) cells pictured in b. Solid lines, data smoothed by factor 5. d, Left, 
schematic of the experiment demonstrating the instability of two soap bubbles 
of interfacial tension T connected by a pipe. P., outside pressure; P;1, Pi, 
pressures inside the bubbles; R,, R2, bubble radii. Right, schematic of a 
cytokinetic cell. Ri, Ro, radii of the future daughter cells; R,, cleavage furrow 
radius; T, polar cortex tension; Kf(v; — v2), cell elastic response; AP is the 
pressure difference between the poles. 
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cortex globally; it is thus unlikely to lead to a strong inhomogeneity in 
tension between the poles. We analysed cortex distribution along the 
cell contour in HeLa and L929 cells expressing myosin regulatory light 
chain (MRLC)-tandem-dimer red fluorescent protein (tdRFP) and 
Lifeact-green fluorescent protein (GFP) to mark F-actin™*. We found 
that in anillin-depleted cells, the cortex was less enriched at the equator 
than in control cells, suggesting an enhanced polar contractility before 
shape oscillations (Fig. 2e-g, Supplementary Fig. 6 and Supplementary 
Movie 8). Another example of global cortex reinforcement was 
observed after cortex ablation in L929 cells, where oscillations could 
be triggered by ablation in any region of the cortex (Supplementary 
Fig. 7a and Supplementary Movie 9, 55% of 60 cells). Laser ablation in 
these cells led to cycles of cortex disassembly and reassembly, and 
resulted in an enhancement of the polar cortex at the expense of the 
equator (Supplementary Fig. 7). Taken together, these observations 
support the hypothesis that a global polar cortex reinforcement favours 
shape instabilities. 

Our initial theoretical description predicted that the dividing cell 
shape should be unstable; however, it did not predict shape oscilla- 
tions, which we systematically observe upon polar cortex perturbation. 
To gain insight into the oscillation mechanism, we analysed cortex and 
pole size dynamics during oscillations in anillin-depleted and laser- 
ablated cells. The mean fluorescence intensities of cortical Lifeact and 
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Figure 2 | Perturbation of the polar cortex enhances cytokinetic shape 
instabilities. a, Top, the local drug delivery experiment; bottom, fluorescent 
image of the released solution (visualized with Alexa Fluor 568; left, greyscale; 
right, pseudo-colour). b, Montage of a cleaving HeLa cell expressing actin- 
GFP, treated with cytochalasin D (CD) at a pole. 0s, onset of cytochalasin D 
application. Red dotted line, position of the releasing pipette; the red arrowhead 
points to the rupture in the actin cortex. c, Left, the laser ablation experiment; 
right, example of polar cortex ablation in a HeLa cell expressing MRLC-tdRFP 
(red circle, ablation spot). d, Time-lapse of shape oscillations in a cleaving HeLa 
cell ablated at the polar cortex (red circle). 0's, onset of ablation. Arrowhead 
points to the bleb induced by cortex ablation. Scale bars, 10 um. e, Left, 
description of the measurement of cortex distribution along the cell periphery. 
f, Representative plots of the cortical distribution of MRLC-tdREFP in a control 
and an anillin-depleted cell (see Supplementary Fig. 1 and Methods). 

g, Integrals of cortical myosin distributions (shaded regions in f), in control and 
anillin-depleted cells shortly before the onset of oscillations. Boxes extend from 
the 25th to 75th percentiles, with a line at the median. P values, Mann-Whitney 
U test (m = 10). a.u., arbitrary units. 
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MRILC oscillated in phase opposition with the size of the poles, indi- 
cating that cortex density increases as a pole contracts (Fig. 3a, b, 
Supplementary Fig. 8 and Supplementary Movies 8 and 9). We thus 
refined our model to include the effect of cortex turnover. With t the 
turnover time, cortex dynamics are described by d(Sc)/dt = S(cg — c)/t, 
where c is the cortex density, co the steady-state density and S the 
surface area of a pole. This equation implies that when perturbed, 
cortex density and the resulting tension relax to their reference levels 
on a timescale t (Supplementary Model section 3). Cytoplasm move- 
ments are described by 2adv;/dt = —AP where « isa friction coefficient 
limiting the variation of the normalized volume v; of a pole in response 
to a pressure difference AP between the poles (Fig. 1d). For intermedi- 
ate values of the stability parameter To/RoK and for slow enough actin 
turnover, the system described by these two equations exhibits spon- 
taneous shape oscillations (Fig. 3c, d and Supplementary Model). The 
underlying mechanism is as follows: if polar contraction is faster than 
cortex turnover, the cortex accumulates during contraction. As elasticity 
slows down contraction, cortex turnover leads to tension decrease at 
the contracting pole and increase at the opposing pole, and reversal of 
the oscillation direction. Cortex and pole size dynamics during oscilla- 
tions could be very accurately fitted to the driving equations of the 
model (Fig. 3e, Supplementary Fig. 9 and Model Section 4). These fits 
allowed us to extract mechanical parameters, including the stability 
parameter To/RoK from the data. We could thus position the oscillating 
cells in the state diagram and found that they were located in the 
linearly unstable oscillatory region (Fig. 3d and Supplementary Fig. 9). 
The period of the simulated oscillations was in good agreement with the 
period measured experimentally (Fig. 3c and Supplementary Fig. 8). 
Finally, the model predicts that the cell has a characteristic response 
time to pressure changes given by the ratio of friction to elasticity 
T, = «/2K. Consistently, we observed that cortex intensities reached a 
maximum and started decreasing before the reversal of the oscillation 
direction (Fig. 3b and Supplementary Fig. 8c, e). These observations 
show that the model (summarized in Fig. 4a) is in very good agreement 
with the experimental data. 

Our findings indicate that polar tension must be tightly controlled 
during cytokinesis, raising the question of how this control is achieved. 
One striking difference between oscillating and control cells is that 
while control cells display blebs at both poles throughout division 
(Supplementary Movie 1), in oscillating cells, blebbing seems to alternate 
between the poles (Supplementary Movies 8 and 9). As bleb growth 
releases cortical tension and the resulting intracellular pressure’®*”, it 
is likely that blebbing effectively lowers polar tension and thus limits 
polar contractions (Fig. 4a). We tested the effect of inducing blebbing by 
laser ablation in oscillating anillin-depleted HeLa cells. Bleb induction in 
a contracting pole stopped the expansion of the opposite pole, while 
cytoplasm flowed into the bleb (Fig. 4b and Supplementary Movie 10), 
indicating that bleb formation can interfere with polar contraction. We 
then globally inhibited blebbing during cell division by treating cells with 
wheat germ agglutinin (WGA) and concavalin A, tetravalent lectins that 
crosslink and stabilize the cortex from the outside”®. These treatments 
strongly decreased bleb formation during cytokinesis and triggered 
shape oscillations and division failure in 35% (WGA, n= 102) to 
45% (concavalin A, n = 130) of L929 cells (Fig. 4c and Supplemen- 
tary Movie 11), suggesting that hindering bleb formation favours 
shape instabilities. On the basis of these experiments, we propose that 
during cytokinesis, blebs act as pressure sinks that reduce contractile 
tension and the resulting intracellular pressure at the cell poles 
(Fig. 4a). Importantly, bleb size must be tightly controlled, as the 
formation of very large blebs can destabilize cell shape by introducing, 
rather than buffering, a tension imbalance between the poles (Fig. 2b, d 
and Supplementary Discussion). 

Notably, shape instabilities arise only for high polar tensions 
(Fig. 3d), and high tension also favours bleb formation. Cortical tension 
and bleb formation could thus constitute a self-regulating mechanical 
feedback loop. Bleb nucleation results either from local detachment of 
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Figure 3 | Dynamic characterization and model of shape oscillations. 

a, Cross-sectional areas of the two poles of an oscillating anillin-depleted L929 
cell and mean intensities of cortical actin and myosin, left and right of the 
cleavage furrow, as a function of time. b, Zoom on the regions highlighted in 
a for the left pole. a, b, Intensities are normalized to their average. Circles, data 
points; solid lines, data smoothed by factor 5. c, Simulation of the evolution of 
the difference of volumes v of the two poles (top) and of the polar actin densities 
(bottom: c,, cortex density; Ca» Mean cortex density) using parameters extracted 
from the cell in a (see Supplementary Model). d, Phase diagram of the linear 
stability analysis of a symmetric cell with respect to volume perturbation in the 
model coupling actin turnover and cell mechanics, in the parameter space (t,/t, 
To/RoK), with R./Ro = 0.8; To is the polar tension before oscillations. Cells are 
positioned on the diagram according to parameters extracted from fits as in 

e (magenta, anillin depleted cells; green, laser-ablated cells). Error bars, 
standard deviation of the fit parameters. e, Fit of the experimental parameters of 
the cell in a (blue points) to the driving equations of the model (red curves). 
Left, fit to the dynamic equation of v, yielding the elastic response of the cell f(v); 
right, fit to the equation for cortex density (see Supplementary Model). 


the membrane from the cortex or from local cortex tearing (Sup- 
plementary Fig. 10). Both mechanisms are favoured by high cortical 
tension’. It is thus possible that the cytokinetic cortex is tuned to form 
blebs, and thus release tension, as soon as polar tension exceeds the 
threshold for shape instabilities. Alternatively, cytokinetic bleb forma- 
tion could result from a mechanosensing signalling cascade. However, 
we found that calcium ions, common mediators of mechanosensory 
responses, which have been shown to be implicated in blebbing”, were 
not involved in cytokinetic bleb formation and shape oscillations 
(Supplementary Fig. 11). Other elements such as the spindle also con- 
tribute to maintaining the cleavage furrow at the equator”’’', and 
mechanical properties, such as cell elasticity, can also prevent shape 
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Figure 4 | Cell shape stability during cytokinesis. a, Top, in control cells, bleb 
formation constantly releases polar pressure and helps ensure the success of 
cytokinesis. Bottom, high polar tension or a large pressure imbalance between 
the poles can trigger shape oscillations. Model of the oscillations: if cortex 
contraction (resulting from tension T) is faster than cortex turnover (green 
arrows), cortex contractility, dynamics and cell elasticity result in cycles of 
cortex contraction, disassembly and subsequent reassembly. b, Laser ablation at 
the contracting pole of an oscillating anillin-depleted HeLa cell arrests the flow 
of cytoplasm between the poles and causes a premature reversal of the flow 
direction (15 cells out of 16). Top, DIC time-lapse; bottom, corresponding 
particle image velocimetry vectors and cell outline. Two consecutive ablations 
were performed (red circles); 0s, first frame after ablation. c, DIC image series 
of a cytokinetic L929 cell oscillating after WGA treatment. Scale bars, 10 pm. 


instabilities. Therefore, polar bleb formation may act asa partly redund- 
ant safety mechanism. Such redundancy is common in cytokinesis and is 
thought to provide robustness to the process”. Future studies will have 
to elucidate how exactly cellular mechanical properties, molecular 
pathways and bleb formation are integrated into a fine-tuned mech- 
anical system ensuring the success of cytokinesis. 

Our findings may have important implications for asymmetric cell 
division. A recent study in C. elegans QR.a neuroblasts’ shows that 
asymmetric divisions can be the direct result of myosin-driven cortex 
contraction of one of the poles at the expense of the other, a mechanism 
that could also be relevant for asymmetric division of Drosophila 
neuroblasts® (see Supplementary Discussion). An important open 
question is how such an asymmetric shape, with more cortical myosin 
and thus a higher tension in the smaller pole, can be stable. It will be 
interesting to investigate the mechanics of contractility-driven asym- 
metric divisions. 

Finally, the oscillation mechanism proposed here could be relevant 
for other systems where cortex oscillations are observed, such as during 
epithelial contractions in Drosophila development” (Supplementary 
Discussion). A common feature of all these oscillations is that they 
display cycles of cortex contraction, disassembly and reassembly, 
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suggesting they might be driven by a common mechanism coupling 
contractility, cortex turnover and cell elasticity. 


METHODS SUMMARY 


Culture conditions and cell treatments. Cell culture, drug treatments, gene 
silencing by RNA interference, knockdown quantification by quantitative PCR 
and cell transfections are detailed in the Supplementary Methods. 

Local cytochalasin D delivery. We conducted local delivery of cytochalasin D 
using two micropipettes mounted on a motorized micromanipulator (Eppendorf) 
connected to pressure regulators. Imaging was performed on a Carl Zeiss Axiovert 
200M inverted microscope. 

Local laser ablation. Laser ablation experiments were performed as described in 
ref. 18 on a scanning confocal microscope (Olympus FV1000) equipped with two 
scanning heads. For ablation, we used a 405-nm picosecond-pulsed laser with a 
nominal power of 3 mW for 2.5s. 

Particle image velocimetry and image processing. Cytoplasmic flows in dividing 
cells were determined with a custom-made particle image velocimetry algorithm 
in MATLAB (MathWorks). Images were processed, automatically analysed and 
visualized using custom code in MATLAB (MathWorks) (detailed in the 
Supplementary Methods). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cell lines and culture. L929 and HeLa fibroblasts were grown in DMEM (GIBCO, 
Invitrogen) supplemented with 10% foetal bovine serum, 1% glutamine and 1% 
penicillin-streptomycin in polystyrene culture flasks (Nunc). For all the time- 
lapse experiments, the cells were plated on glass-bottom dishes (Mattek), main- 
tained at 37 °C and supplied with 5% CO, on the microscope stage. All cell lines are 
adherent and maintain attachments to the substrate throughout division 
(Supplementary Fig. 5 and Supplementary Movie 5). The stable HeLa-GFP-actin 
line was a gift from F. Buchholz. The stable Hela~EB3-GFP-Lifeact-mCherry line 
was a gift from the laboratory of M. Piel. 

Cell treatments, plasmids, RNAi and cell transfection. Rho-kinase was inhib- 
ited with Y27632 (Merck Bioscience, at least 30 min treatment at 101M). To 
stabilize the cortex and limit bleb formation, we used wheat germ agglutinin 
(Invitrogen) and concavalin A (Sigma Aldrich-Chemie) at 25 pg ml ! for up to 
2h as described in refs 26 and 31. Lifeact-GFP™ was a gift from R. Wedlich- 
Sdldner. MRLC-tdRFP wasa gift from G. Charras. mCherry-CAAX was generated 
by replacing the GFP cassette with an mCherry cassette in a CAAX-GFP plasmid 
(gift from G. Charras), by cutting with BamH1 and EcoRI (performed by M. 
Bergert). For experiments requiring increased substrate adhesiveness, glass- 
bottom dishes were coated with 15 1g ml‘ collagen type I from rat tail (Sigma) 
and fibronectin 0.1% solution (diluted 1:1,000 in H,O) from bovine plasma 
(Sigma). Mouse stealth anillin short interfering RNA (siRNA) (5’-CCGUUGGA 
AGGUCACAUCUGUUUAA), stealth Gelsolin (5'-ACGGGUGAUGCCUAU 
GUCAUCCUAA) and scrambled control were obtained from Invitrogen. 
Mouse cofilin siRNA (5'-GAAUCAAGCAUGAAUUACATT) was obtained from 
Ambion. Human anillin siRNA (5'-GAUGCUUGCUACAAACCUATT) was 
obtained from Qiagen. For L929 cells, transfection of 2 1g per well plasmid 
DNA or 4pmol per well siRNA was achieved by electroporation using the Cell 
Line 96-well Nucleofector Kit SE (Lonza) according to the instructions supplied 
(program code EH-100). For siRNA transfection in HeLa cells, Lipofectamine 
RNAiMax (Invitrogen) was used. The final concentration of siRNA was 10 nM. 
For co-transfection of plasmid and siRNA in HeLa cells, Lipofectamine 2000 
(Invitrogen) was used. Final concentrations were 10nM for siRNA and 
0.2 pg ml! for plasmid. Cells were used 30h after transfection in laser ablation 
and anillin depletion experiments. Intracellular Ca”* was chelated by an incuba- 
tion of no less than 15 min in 40 1M BAPTA-AM (Invitrogen). Extracellular Ca** 
was removed by performing the experiments in Ca’* -free normal Tyrode’s solu- 
tion. Additional chelation with 500 1M EGTA (Sigma) was tested but had no 
further effect. Microtubules were depolymerized with 5 1M nocodazole (Sigma). 
RNA isolation and quantitative PCR. The extent of anillin depletion in L929 and 
HeLa cells (Supplementary Fig. 14) was estimated by performing qPCR with 
reverse transcription and comparing the levels of messenger RNA (mRNA) to 
endogenous levels of mBeta-2- microglobulin (L929) or GAPDH (HeLa) mRNA. 
Total RNA was isolated using the RNeasy kit (Qiagen) according to the manu- 
facturer’s instructions. RNA was cleaned of genomic DNA by DNase digestion 
with RNase-Free DNase Set digest kit (Qiagen). Complementary DNA was sub- 
sequently synthesized from 0.5 ug of RNA using SuperScript III RNase H-Reverse 
Transcriptase kit (Invitrogen). Comparative qPCR was performed using the 
Absolute qPCR SYBR Green mix (Thermo Fisher Scientific) and an Mx3000P 
cycler (Stratagene). Primers were as follows: L929 Anillin, forward 5’-AAACT 
CGAATGCAAAGGCTTG, reverse 5'’-AATGGGTGGCTTAGAAGGTG; L929 
mBeta-2-microglobulin, forward 5’-ATTCACCCCCACTGAGACTG, reverse 
5'-TGCTATTTCTTTCTGCGTGC; Hela Anillin, forward 5’-GACAAATCT 
ACGACACCAGGAG, reverse 5’-CTTGGATGGCCTTTGTATTTGG; HeLa 
GAPDH, forward 5'-GAGTCAACGGATTTGGTCGT, reverse 5’-TGGAAGA 
TGGTGATGGGATT. 

Local cytochalasin D delivery. Local delivery of cytochalasin D** was performed 
on a Carl Zeiss Axiovert 200M inverted microscope equipped with motorized 
micromanipulator arms (Eppendorf TransferMan NK2) and imaged through a 
Carl Zeiss Neofluar X63 (numerical aperture = 0.75) objective. Micropipettes 
were pulled with a flaming pipette-puller (P-97, Sutter Instruments), and forged 
with a home-built forge. The releasing micropipette was loaded with 20 ul of 
5gml ' cytochalasin D (Sigma) diluted in culture medium additioned with 
Alexa Fluor 568 Succinimidyl Ester diluted to a final concentration of 600 nM 
(Invitrogen), and mounted together with the aspiration micropipette on a double 
pipette holder (Eppendorf TwinTip-Holder). The aspiration pipette was posi- 
tioned slightly behind the releasing pipette (Fig. 2a). Both micropipettes were 
connected to pressure regulators: a FemtoJet Microinjector (Eppendorf) for the 
releasing pipette and a pump for the aspiration pipette (Eppendorf CellTram Air). 
The flowfield was monitored by tracking fluorescent Alexa Fluor 568. The pipettes 
were positioned so that only one pole of a dividing cell was exposed to the released 
drug. The release of cytochalasin D was maintained only until local bleb formation 


was observed (about 30 s). For the dimethylsulphoxide control, cytochalasin D was 
replaced by dimethylsulphoxide at 1:1,000 dilution in the releasing pipette. 
Laser ablation and cell imaging. Laser ablation experiments were performed as 
described in ref. 18 on a scanning confocal microscope (Olympus FV1000) 
equipped with two scanning heads. The first was used for standard imaging; the 
second was coupled to a laser head (LDH-P-C-405B, PicoQuant), driven by a 
power source (PDL 800-B, PicoQuant) delivering 405-nm picosecond pulses with 
a nominal power of 3 mW. For ablation, the pulsed laser underwent spiralling 
movements within a circle of 800-nm nominal diameter for 2.5 s. Unless otherwise 
specified, cells were imaged transversally through a single central confocal Z plane, 
midway between the substrate and the top of the cell, through an Olympus 
UPlanSApo X60 oil immersion objective (numerical aperture = 1.35). Imaging 
for volume measurements was performed on an Andor IQ spinning disc confocal 
microscope by acquiring Z stacks through the entire volume of cells, through an 
Olympus UPlanSApo X60 water immersion objective (numerical aperture = 1.2) 
and X1.6 Optovar. Long-term time-lapse microscopy was performed on an 
Olympus-IX81 widefield microscope through an Olympus UPlanSApo X20 air 
objective (numerical aperture = 0.5). 

Image processing and automated image analysis. Images were processed, auto- 
matically analysed and visualized using custom code in MATLAB (MathWorks). 
Particle image velocimetry’ was performed on DIC images yielding local mea- 
sures of the cytoplasmic flows. Particle image velocimetry is an image-processing- 
based technique for measuring fluid displacement by cross-correlation between 
images from consecutive time points. For illustration purposes, the flow vectors 
were colour-coded according to their angles with respect to the cleavage furrow. 
Vectors with rightward direction were coloured green, and vectors with leftward 
direction were coloured red. The mean vector component along the pole—pole axis 
was calculated for each time-point then smoothed with a moving average of win- 
dow size 5. The angular distribution and amplitudes of the mean vectors per time- 
point formed the basis of the classification of cells as either displaying cytoplasmic 
flows or not (Supplementary Fig. 2). Flows in cytokinetic cells were considered 
significant when their amplitude was more than two times higher than the ampli- 
tude of flows typically observed in metaphase cells. 

Cell contour detection and definition of cellular regions: segmentation and 
tracking of cells was based on a combination of image filtering, ellipse fitting, 
adaptive thresholding and edge detection operations on raw fluorescent intensities 
from one of the fluorescent channels. The cortex was defined as a region of fixed 
depth underlying the segmented cell contour. The cytoplasmic region was taken as 
the difference between the segmented cell area and the cortical region. The cleavage 
furrow was defined by a straight line between two points, the positions of which 
were determined automatically along the cell contour. The end points were reposi- 
tioned manually when the algorithm failed to position the furrow line correctly. 
The furrow line delimited the left side of the cell from the right side. The definition 
of the cell contour, cortical, cytoplasmic and left/right delimitations allowed for the 
extraction of region-specific morphological parameters, intensity values and oscil- 
lation periods (Supplementary Fig. 12). 

Cortex intensity measurements: mean fluorescence intensities [ean Of F-actin 
(Lifeact) and myosin (MRLC) in the cortex were determined in the defined cortical 
region (described above). As the depth of this region was kept constant while the 
apparent thickness of the cortex varied both along the cell contour and within a 
time-lapse, the cortical region was chosen so as to overestimate the actual cortex. 
The region therefore included pixels belonging both to a varying depth of cortex 
and to the cytoplasm. The mean intensities measured in this region thus constitute 
a combined measure of both the average cortical intensity per unit area and the 
actual thickness of the cortex. Cortical intensity profiles (Fig. 2f and Supplemen- 
tary Figs 1, 6 and 7) depict the mean cortical intensity along the cell contour 
Imean(S), Where s is the position along the cell contour. The profiles start at one 
intersection of the furrow line with the cell contour and then follow the cell 
contour in the cortex region in a clockwise direction. To allow for comparison 
of cells with different contour lengths, the path length was set to a constant length 
of 1,000 points and intensity values were interpolated by a factor of 1,000 per actual 
path length. The intensity values were then normalized to the global cortex mean 
intensity (Imean(S)/<Jmean>), 80 that values greater (respectively smaller) than 1 
indicate that the local cortex intensity is larger (respectively smaller) than the 
global cortex mean. To measure the deviation from a perfectly flat intensity profile, 
which would indicate a perfectly uniform distribution of intensities in the cortex, we 
integrated the deviations of the intensity profiles of cortical proteins, normalized as 
described above, from 1 (|[Imean(s)/ <Imean> — 1|ds). 

Volume measurements. Volume measurements during cytokinesis were performed 
on dividing L929 cells (Supplementary Fig. 13). Oscillations were induced by anillin 
depletion, as described above. Cells were loaded with an intracellular fluorescent dye 
(2.5uM CMFDA (5-chloromethylfluorescein diacetate), Invitrogen) excitable at 
488 nm and emitting at approximately 520 nm for 30 min at 37 °C and 5% CO; 
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and then changed into conditioned medium for 1h before imaging. 5 uM Alexa 
Fluor 568 Succinimidyl Ester (Invitrogen) was then added to the medium as an 
extracellular dye excitable at 561nm and emitting at approximately 590 nm. 
Imaging and data acquisition were performed on an Andor IQ spinning disc 
confocal microscope. Time-lapses were bleach corrected using a custom plugin, 
the Alexa Fluor 568 channel inverted and multiplied with the CMFDA channel 
using Fiji Is Just ImageJ. Surface rendering and volume calculations were then 
performed in Imaris X64 (Bitplane). We estimate an error in volume measure- 
ments no greater than 10.65%, based on an allowed error of 1 voxel diagonal in 
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determining the cell boundary at every point of a spherical cell of 3,000 pm? 
(imaging voxels were on average 181 nm X 181 nm X 580 nm). 
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A role for cohesin in T-cell-receptor rearrangement 
and thymocyte differentiation 


Vlad C. Seitan!*3*, Bingtao Hao**, Kikué Tachibana-Konwalski’*, Thais Lavagnolli’’, Hegias Mira-Bontenbal’”, 
Karen E. Brown!®, Grace Teng®, Tom Carroll’, Anna Terry”, Katie Horan’, Hendrik Marks*, David J. Adams’, David G. Schatz®"°, 
Luis Aragon”?, Amanda G. Fisher’?, Michael S. Krangel*, Kim Nasmyth® & Matthias Merkenschlager!? 


Cohesin enables post-replicative DNA repair and chromosome 
segregation by holding sister chromatids together from the time 
of DNA replication in S phase until mitosis’. There is growing 
evidence that cohesin also forms long-range chromosomal cis- 
interactions’ * and may regulate gene expression” "° in association 
with CTCF*”, mediator* or tissue-specific transcription factors’. 
Human cohesinopathies such as Cornelia de Lange syndrome are 
thought to result from impaired non-canonical cohesin functions’, 
but a clear distinction between the cell-division-related and cell- 
division-independent functions of cohesion—as exemplified in 
Drosophila’ ’—has not been demonstrated in vertebrate systems. 
To address this, here we deleted the cohesin locus Rad21 in mouse 
thymocytes at a time in development when these cells stop cycling 
and rearrange their T-cell receptor (TCR) a locus (Tcra). Rad21- 
deficient thymocytes had a normal lifespan and retained the ability 
to differentiate, albeit with reduced efficiency. Loss of Rad21 led to 
defective chromatin architecture at the Tcra locus, where cohesion- 
binding sites flank the TEA promoter and the Ea enhancer, and 
demarcate Tcra from interspersed Tcrd elements and neighbour- 
ing housekeeping genes. Cohesin was required for long-range 
promoter-enhancer interactions, Tcra transcription, H3K4me3 
histone modifications that recruit the recombination machinery'** 
and Tcra rearrangement. Provision of pre-rearranged TCR trans- 
genes largely rescued thymocyte differentiation, demonstrating 
that among thousands of potential target genes across the gen- 
ome**"°, defective Tcra rearrangement was limiting for the differ- 
entiation of cohesin-deficient thymocytes. These findings firmly 
establish a cell-division-independent role for cohesin in Tcra locus 
rearrangement and provide a comprehensive account of the 
mechanisms by which cohesin enables cellular differentiation in a 
well-characterized mammalian system. 

The somatic rearrangement of lymphocyte receptor loci is central to 
adaptive immunity’®. Gene segments distributed over millions of base 
pairs of genomic DNA are transcribed, brought into proximity with 
each other, and recombined in a cell-lineage- and developmental- 
stage-specific fashion’””’. In developing thymocytes, proliferation 
and differentiation are tightly linked and the activity of Rag (recom- 
bination activating gene) proteins is restricted to the G1 phase of the 
cell cycle”. Early thymocytes at CD4~ CD8™ double-negative (DN) 
stages 1 and 2 proliferate in response to cytokines and briefly arrest at 
the DN3 stage, where they rearrange the TCR locus (Fig. 1a). Pre- 
TCR signals drive a phase of proliferation that extends to the early 
CD4* CD8* double-positive (DP) stage. Shortly after the acquisition 
of CD4 and CD8, DP thymocytes lose the expression of the transferrin 
receptor CD71 (ref. 21) and become small, non-proliferating CD71 


DP cells (Fig. 1a), which represent the great majority of thymocytes. 
During their life span of 3 to 4 days, DP thymocytes undergo several 
rounds of Tcra rearrangement'®’*”’. Successful TCR expression and 
engagement selects a minority (3-5%) of DP thymocytes for differ- 
entiation via a CD4* CD8'° intermediate stage towards long-lived 
CD4 or CD8 single-positive (SP) cells, again with minimal prolifera- 
tion’®” (Fig. 1a). 

To address the role of cohesin in Tcra rearrangement and thymo- 
cyte differentiation, we combined a conditional allele encoding the 
cohesin subunit Rad21 (Rad21'°*; Fig. 1b and Supplementary Fig. 
la) with a Cre transgene under the control of Cd4 regulatory elements 
(CD4-Cre), which becomes active at the transition from the CD4— 
CD8 DN to the DP stage”’. Pilot experiments with yellow fluorescent 
protein (YFP) reporters showed CD4-Cre-dependent accumulation of 
YFP after the CD71” proliferative stage in non-dividing DP thymo- 
cytes (Supplementary Fig. 1b). Proliferating CD4-Cre Rad21'°*"'™* 
CD71~ DP cells showed partial locus deletion but retained >50% 
Rad21 messenger RNA and protein (Fig. 1c). Rad21 genomic deletion 
was essentially complete (>97%) and Rad21 RNA and protein levels 
were substantially reduced in non-dividing DP thymocytes (Fig. 1c). 
Hence, cohesin was selectively depleted from non-dividing thymo- 
cytes. Importantly, CD4-Cre Rad21'°*"* DP thymocyte numbers were 
normal (Fig. 1d). Intermediate CD4* CDs" and mature CD4 SP and 
CD8 SP thymocytes accumulated slowly in CD4-Cre Rad21'°*"™* mice 
(Supplementary Fig. 2a) but were present in normal numbers by 6 
weeks of age (Fig. 1d). 

Bromodeoxyuridine (BrdU) incorporation into replicating DNA can 
identify proliferating thymocyte populations and track their differenti- 
ation”, Pulse-chase experiments labelled proliferating (CD71~) DP but 
not non-proliferating CD4* CDs8'°, CD4 SP or CD8 SP thymocytes”? 
(Fig. le, 2h time point). During the subsequent chase period, BrdU- 
labelled DP cells differentiated to become CD4* CD8"° and eventually 
CD4 or CD8 SP cells (Fig. Le, left). This sequence of differentiation was 
preserved in CD4-Cre Rad21'*"* thymocytes, but the proportion of 
DP thymocytes that became CD4 SP or CD8 SP was reduced (Fig. le, 
right, and Supplementary Fig. 2b). 

In continuous BrdU labelling experiments, the percentage of BrdU ~ 
cells indicates population turnover”. Importantly, cohesin-deficient 
and control DP thymocytes labelled with similar kinetics (Fig. 1f). 
Consistent with the pulse-labelling data (Fig. le), the accumulation 
of CD4* CDs8'° and CD4 SP subsets was reduced in CD4-Cre 
Rad2]'*"™ mice (Supplementary Fig. 2c). Hence, cohesin depletion 
impaired the differentiation of DP thymocytes, but not their survival. 

Unlike many other differentiated cell types, mature thymocytes can 
be induced to re-enter the cell cycle. In vitro activated CD4-Cre 
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Figure 1 | Genetic cohesin depletion in non-dividing thymocytes. 

a, Thymocyte differentiation from left to right: CD47 CD8 DN stages 1 to 4; 
CD4* CD8* DP, CD4* CD8"; CD4 or CD8 SP cells. Proliferation is in green, 
cell cycle arrest in red. Histograms show DNA content. b, Conditional Rad21 
allele (see Supplementary Fig. 1a). c, Real-time genomic PCR of Rad21 locus 
deletion, RT-PCR of Rad21 RNA and western blotting of Rad21 protein. d, Cell 
numbers and flow cytometric analysis of thymocyte subsets in 6-week-old 
CD4-Cre Rad21!*'™ and CD4-Cre Rad21' mice (mean + standard 


Rad21'**"** CD4 SP thymocytes showed abnormal mitotic figures with 
multiple spindles, chromosome segregation defects (Fig. 1g and 
Supplementary Fig. 3) and poor survival (Fig. 1g). CD4—Cre-mediated 
deletion of Rad21 therefore generates thymocytes that die when forced 
to divide, yet have a normal lifespan as non-dividing cells in vivo. This 
allows the interrogation of cohesin functions in interphase, independ- 
ent of essential cohesin functions during cell division. 

Rad21 chromatin immunoprecipitation and sequencing (ChIP- 
seq) mapped cohesin to key positions within the Tcra locus in DP 
thymocytes (Fig. 2a). Cohesin was abundant at the locus control 
region”, which separates the Tcra enhancer Ex from the neighbouring 
Dad1 housekeeping gene**”®. Other prominent cohesin sites separated 
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0 24 
Continuous BrdU labelling 


48 72h 


9 CD4-Cre Rad27!t CD4-Cre Rad21!%/!ox 


0.05 x 108 


CD4 


CD8 CD8 


deviation (s.d.), n = 12). e, Pulse chase analysis of CD4-Cre Rad2'™ and 
CD4-Cre Rad21'“'™ thymocytes. Dot blots are gated on BrdU* cells fe 
Supplementary Fig. 2b). f, Continuous BrdU labelling for DP thymocyte 
turnover. See Supplementary Fig. 2c for CD4* CD8"° and CD4 SP subsets 
(mean + s.d., n = 3-5 per data point). g, Top, metaphase spreads of 2 day 
activated thymocytes stained for «-tubulin (green) and DNA (DAPI, blue; see 
Supplementary Fig. 3). Magnification, X2,500. Bottom, cells recovered after 
5 days. 


the Tcra TEA promoter from the Tcrd enhancer E86, which controls 
Tcrd gene segments that are interspersed within the Tcra locus but 
follow a distinct developmental stage-specific program'®’. Cohesin 
colocalization with the insulator protein CTCF*” is found at the Dad1 
site 10kb downstream of Ex, whereas at the Ju49 promoter cohesin 
associates with its loading protein Nipbl and mediator subunits* more 
than with CTCF (Supplementary Fig. 4). Interestingly, the major Tcra 
regulatory elements Ex and TEA bound copious amounts of cohesin, 
Nipbl and mediator as well as CTCF (Supplementary Fig. 4). 

RNA sequencing (RNA-seq) indicated that Tcra constant region 
(Ca) transcripts were considerably more abundant than transcripts 
from the neighbouring Dad1 gene and the Tcrd constant region (Cd) in 
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Figure 2 | Cohesin affects Tcra transcription and Rag recombinase 
recruitment. a, Rad21 ChIP-seq of the 3’ part of the Tcra locus in DP 
thymocytes. Arrowheads highlight cohesin sites at the Ex enhancer (black), the 
Tcra locus control region (grey), Ju promoters (turquoise), the TEA promoter 
(green) and between Tcrd elements and V gene segments (blue). b, RNA-seq of 
Tcra in CD4-Cre Rad21!°*"'°* (red) and control Rad21'* (black) DP 
thymocytes. Rad21 ChIP-seq is in blue. c, ChIP of H3K4me3 relative to total H3 
in CD4-Cre Rad21'°*™* and control DP thymocytes. Hbb is a negative and Actg 
and Elp4 are positive control loci (mean = s.e. of two independent 
experiments). P = 0.016 for all Jax elements; P = 0.38 (not significant) for 
proximal (Jx61-48) and P = 0.004 for distal (Ja37-16) Ja elements. d, ChIP of 
Rag? relative to total H3 as inc (mean = s.e. of three independent experiments). 
P= 0.0001 for all Ju elements; P = 0.054 (not significant) for proximal (Jo61- 
48) and P = 0.0001 for distal (Ja37-16) Jo elements. e, ChIP of Rag] relative to 
total H3 as in c (mean + s.e. of two independent experiments). P = 0.005 for all 
Jo elements; P = 0.17 (not significant) for proximal (J%61-48) and P = 0.003 for 
distal (Ju37-16) Jox elements. 


control DP thymocytes. In cohesin-depleted small DP thymocytes, Cd 
and Dad1 transcripts were elevated at the expense of Co transcripts 
(Fig. 2b) as confirmed by real-time polymerase chain reaction with 
reverse transcription (RT-PCR) (Supplementary Fig. 5a). Moreover, 
transcription across the Tcra joining elements, Ju, was skewed: control 
DP thymocytes preferentially transcribed distal (3’) Jo elements, 
whereas cohesin-depleted DP thymocytes preferentially transcribed 
proximal (5") Ja elements (Fig. 2b). 

Transcription of lymphocyte receptor loci facilitates rearrange- 
ment” in part via the trimethylation of histone H3 at lysine 4 
(H3K4me3). H3K4me3 recruits Rag2 protein'*"*’*, which together 
with Ragl forms the recombinase complex’*. ChIP showed reduced 
H3K4me3 deposition (Fig. 2c) and Rag binding (Fig. 2d, e) at distal Ja 
elements in CD4-Cre Rad21'°“' DP thymocytes. Hence, cohesin 
deficiency affected Tcra transcription, H3K4me3 histone modifica- 
tions and the recruitment of Rag recombinases. 

Primary Tcra rearrangements involve proximal (5) Jo elements and 
occur in early CD71* DP thymocytes, whereas secondary rearrange- 
ments involve progressively more distal (3’) Ja elements in non- 
dividing DP thymocytes"® (Supplementary Fig. 1b). CD71* DP 
CD4-Cre Rad21'*'* thymocytes had near-normal Rad21 protein 
levels (Fig. 1c) and Cd, Cx and Dad1 transcription (Supplementary 
Fig. 5b), and primary rearrangements of proximal Jo elements were 
present at normal levels (Fig. 3a, b). In contrast, secondary Tcra re- 
arrangements were substantially impaired in non-dividing CD4-Cre 
Rad21'*"** DP thymocytes (Fig. 3a, b), which were depleted of 
cohesin (Fig. 1c). The usage of the distal Jx22 element, for example, 
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Figure 3 | Cohesin affects Tcra rearrangement. a, Threefold dilutions of 
genomic Va8-Ja PCR products from CD71* or CD71” DP thymocytes 
visualized with ethidium bromide. Cd14 is a genomic control. b, Genomic PCR 
products from small DP thymocytes visualized by Southern blotting with Jo- 
specific probes (left). Usage of the distal Ju22 element was reduced by 86% 


Jos8 53 42 22 2 


(middle; mean = s.e., n = 3). Southern blotting of Vo8-Jo. RT-PCR products 
from CD4-Cre Rad21'**"** normalized to control DP thymocytes (right; 
mean + s.e., 1 = 3). ¢, Double-strand breaks in threefold serially diluted 
genomic DNA from CD4-Cre Rad21'°*"™ and control DP thymocytes detected 
by ligation-mediated PCR. 
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Figure 4 | Cohesin mediates long-range interactions between regulatory 
elements that control Tcra transcription. a, Transcript copy number of the 
unrearranged Tera J region in Rag!-deficient CD4-Cre Rad21'“"™ and 
Rad21'°*' DP thymocytes (mean + s.e., n = 3). b, 3C analysis of long-range 
interactions between Ex and Tera restriction fragments (shaded) in Rag1- 
deficient CD4-Cre Rad21'™ (black; mean + s.d., n = 3), CD4-Cre 
Rad21'**"'* (red; mean + s.d., n = 3) DP thymocytes and pre-B cells (grey; 
mean ~ s.d., n = 3). Intervening HindIII fragment numbers and genomic 
distances are indicated. Asterisks indicate P< 0.05 (grey, control thymocytes 
versus pre-B cells; red, control versus cohesin-depleted thymocytes). NS, not 
significant. c, TCR transgenes rescue the differentiation of cohesin-depleted 
thymocytes. Top, percentages of BrdU“ cells in CD4* CD8"°, CD4 SP and CD8 
SP thymocytes. The differentiation of cohesin-deficient thymocytes is rescued 
by MHC class II-restricted (middle) and MHC class I-restricted TCR 
transgenes (bottom). N/A, not applicable. n = 3-5 per data point + s.d. 
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was reduced on average by 86% (Fig. 3b, middle), reflecting a progres- 
sive underrepresentation of 3’ Ja segments (Fig. 3b, right). This was 
confirmed by analysis of additional Va gene families (Supplementary 
Fig. 6a) and by RT-PCR-based copy number analysis of mature Tcra 
transcripts (Supplementary Fig. 6b). Therefore, primary rearrange- 
ments occurred before the depletion of cohesin, whereas reduced 
cohesin expression impaired secondary rearrangements in non- 
dividing DP thymocytes. Normal BrdU labelling kinetics (Fig. 1f) 
exclude decreased lifespan as an explanation for aberrant Tcra 
rearrangement”. Consistent with the defective recruitment of Rag 
proteins to the Tcra locus (Fig. 2d, e), double-strand breaks were 
reduced in cohesin-deficient thymocytes (Fig. 3c). This identifies 
Rag cleavage rather than double-strand break repair as the limiting 
step for Tcra rearrangements in cohesin-deficient thymocytes. 

Because Tcra rearrangement changes the positioning of regulatory 
elements’*, altered Tcra transcription (Fig. 2b) could either be a direct 
consequence of cohesin depletion, or result indirectly from defective 
rearrangement. To distinguish between these possibilities we compared 
intronic Tcra transcript copy numbers in the absence of rearrangement 
in Ragl-deficient control and CD4-Cre Rad21'“* DP thymocytes. 
Cohesin depletion reduced the transcription (Fig. 4a) and H3K4me3 
methylation (Supplementary Fig. 7) at Jo independently of Tcra 
rearrangement. 

To explore how cohesin affects Tcra transcription, we analysed 
long-range interactions between the TEA promoter and Ea, which 
are separated by approximately 80 kb of genomic DNA and together 
regulate the transcription of Tcra. In chromosome conformation cap- 
ture (3C) assays*’, Eo interacted strongly with TEA in DP thymocytes 
(Fig. 4b) but cohesin depletion reduced these interactions to the level 
found in pre-B cells, where Tcra is not detectably transcribed (Fig. 4b). 
Hence, the extent of Tcra enhancer-promoter interactions was cell- 
type specific, correlated with Tcra transcription and was cohesion 
dependent. A role for cohesin in additional enhancer—promoter inter- 
actions during sequential Tcra rearrangements is suggested by Ex 
contacts with promoters between Ja49 and Ja37 (Fig. 4b), which can 
drive Jo: transcription in the absence of TEA“, and by cohesin binding 
to numerous Va promoters (Fig. 2a). Cohesin depletion also affected 
Eo interactions with the neighbouring Dad1 cohesin site, and therefore 
the topology of the Tcra locus control region”, which has CTCF- 
dependent transcriptional insulator function**”® (Fig. 4b). As cohesin 
mediates CTCF-dependent transcriptional insulation®*’, increased 
Dad1 expression at the expense of Cu (Fig. 2b) may indicate impaired 
insulator function. 

To test whether aberrant Tcra rearrangement caused inefficient dif- 
ferentiation, we equipped cohesin-deficient thymocytes with transgenes 
encoding rearranged TCRs. Compared to endogenously rearranged 
TCRs (Fig. 4c, top), the expression of MHC class II- (Fig. 4c, middle) 
or MHC class I-restricted TCRs (Fig. 4c, bottom) markedly improved 
the generation of CD4-Cre Rad21'*“"* CD4* CD8" and SP thymocytes. 

In summary, cohesin shapes the chromatin architecture of the Tcra 
locus by mediating cell-type-specific long-range interactions between 
enhancer and promoter elements that control transcription, H3K4me3 
deposition, Rag recombinase recruitment, and ultimately Tcra 
rearrangement. These defects compromise thymocyte differentiation 
by limiting the number and diversity of sequential Tcra rearrange- 
ments. Hence, cohesin contributes to cellular differentiation in a 
well-characterized mammalian system. 


METHODS SUMMARY 


The conditional Rad21 allele was generated by inserting loxP sites into introns 4 
and 6 (Supplementary Fig. 1a). Methods used for RT-PCR and genomic PCR’, 
flow cytometry’, 3C analysis’, ChIP for cohesin’, histone modifications and 
Rag proteins” have been described. See Methods for other mouse strains, 
BrdU labelling and detection, ChIP-seq and RNA-seq protocols, ligation- 
mediated PCR, confocal microscopy, Tcra rearrangement assays and copy number 
measurements. 
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Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Mice. Ragl-deficient mice*', OT-I as well as AND TCR transgenic mice**** have 
been described. To generate a conditional Rad21 allele, loxP sites were inserted into 
introns 4 and 6 to allow Cre-mediated excision of exons 5 and 6. Correctly targeted 
clones were detected by Southern blotting of Bmtl and Pacl digested genomic 
DNA to confirm the presence of 5’ and 3’ loxP sites, respectively. A puroAtk 
cassette flanked by FRT sites was excised to generate the conditional allele 
(Supplementary Fig. 1a). For genomic deletion analysis DNA was extracted and 
50 ng were used per real-time PCR reaction. Primers were designed to detect the 
intact and not the deleted Rad21'™ allele. Data were normalized to the geometric 
mean of two genomic DNA (gDNA) sites on chromosome 2 and 3, respectively. 

Rad21'™, forward, CATGGTTGGCAGATGAGCAAC; Rad21'*, reverse, 
CTACTTTCCCGCTAGCAACTG; gDNA chr 2, forward, CCCGGGCTAAT 
TCTCCTATGTC; gDNA chr 2, reverse, GCTGTAAAGTCAGTCGCTCGTG; 
gDNA chr 3, forward, TGTGCCAGCATCTTTTGCC; gDNA chr 3, reverse, 
GCGTGCTGGAATTAAAAGCC, 

Mice were on a mixed C57BL/6 and 129 background and were matched for age 
and Tcra haplotypes in all experiments involving Tcra expression or rearrange- 
ment. Tcra haplotypes were determined by genomic PCR. 

Primer 1, GAGGAAAAATGGCCCGGTAG; primer R, TTCAGTAGTCCCT 
CTCCACG. PCR amplification 94 °C for 5 min, 5 cycles of 94 °C for 30 s, 68 °C for 
30s, 72°C for 1 min, 5 cycles of 94°C for 30s, 65°C for 30s, 72°C for 1 min, 5 
cycles of 94°C for 30s, 60 °C for 30s, 72 °C for 1 min, 20 cycles of 94°C for 30s, 
58 °C for 30 s, 72 °C for 1 min, 10 min extension at 72 °C, HindIII digestion yields a 
300 bp band for 129 and a 150 bp band for C57BL/6. 

Rag!-deficient thymocytes were induced to progress to the CD4* CD8* DP 
stage by injecting 4-6 week-old Rag1-deficient mice once with 100 tg anti-CD3 
(145-2C11) intraperitoneally. Thymocytes were harvested 10 days later. For BrdU 
labelling experiments, mice were given a single intraperitoneal injection of 2 mg 
BrdU (pulse-chase) or 0.6 mg BrdU every 12 h (continuous labelling). Mouse work 
was done under project licence PPL70/6845 issued by the Home Office, UK. 

Ju segment usage was assessed by RT-PCR using Va8 and Cx primers, follow- 
ing which Southern blots of PCR products were hybridized with radiolabelled Ju- 
specific and Cu-specific probes”. 

Ligation-mediated PCR. This was done as described” using Jo.-specific primers 
and probes (5’ to 3’). Ja61, primer (internal), AGTCCCTTCTCAGGCAAGAT 
GG; probe, TGAGGAACACGGAGTATCTC. Jo58, primer, ATGGCTTTGGAC 
CATGGATG; probe, TCTGAAACTCGCACAGTGGA. Ja49, primer, AGGGAA 
AGTGACAACCAGGC; probe, AGGTGTAGATTCTAGCTCTG; Ja27, primer, 
ATGGCAGATAGAATGGAGCGG; probe, TACCTCCACCTGTCTTCTCA. 
Ja6, primer, ATCAGACCAGACTGTCTGCCC; probe, GACCAATGGCAAAG 
GGAGGT. 

Confocal analysis. Mitotic figures were stained for «-tubulin (DM1A, Sigma) and 
DAPI as described”. 

Real-time PCR. Real-time PCR was carried out in a 20 pil reaction volume using 
iQ SYBR Green Supermix (Bio-Rad) and 0.5 tM primers. Amplification was 95 °C 
for 3 min, then 40 cycles of 94 °C for 15 s, 60 °C for 30s and 72 °C for 30 s followed 
by plate read. 

RT-PCR and transcript copy number analysis. Total RNA was extracted using 
RNAbee (Tel-Test) and 1 ug was treated with 11 Turbo DNase (Ambion). 
Following DNase inactivation the RNA was reverse transcribed using 200 U 
Superscript III (Invitrogen) and 0.5 1g oligo(dT) or 3 ug random primers in a 
20 ul reaction volume. 1/80 was used per real-time PCR reaction. 

To estimate the number of transcripts, standard curves were generated for each 
pair of primers. The PCR amplicons generated by individual primer pairs were 
separated by agarose gel electrophoresis, gel purified and their concentration was 
determined using Quant-iT Picogreen (Invitrogen). The molar weight of the 
amplicons was calculated based on their sequence using OligoCalc’’, which 
allowed the enumeration of copies in each sample. Twofold serial dilutions from 
2,500,000 to 1,220 copies (spliced transcripts) and 2,000,000 to 976 copies (intro- 
nic transcripts) were analysed by real-time PCR. The standard curves were com- 
piled by plotting the real-time PCR C, values against seeded copy numbers and 
subsequently fitting a logarithmic function. Only primers that generated a log fit 
R? > 0.99 were used. These standard curves were subsequently used to extrapolate 
the number of transcripts in the biological samples based on the C, values obtained 
after real-time RT-PCR. For the analysis of spliced transcripts the forward primer 
was specific to the respective Ja segment and the reverse primer to Ca. 

eJu56, forward, CTGGAGGCAATAATAAGCTGACT; eJa52, forward, 
AACACTGGA AAGCTCACGTTTGG; eJ«:44, forward, GGCAGTGGTGGAAA 
ACTCACT; eJo38, forward, AGCTGATTTGGGGCTTGGGGA,; eJo31, forward, 
GGACGCAGCTGGTGGTGAAGC; eJa23, forward, GCTTATCTTTGGACAG 
GGAACCA,; eJo118, forward, AACTTTTGGAATCGGAACCAGGG; eJa9, forward, 


ACAAACTTACCTTCGGGACAGG; TCRa-C, reverse, TCCATAGCTTTCATGT 
CCAGC. 

Germline transcripts were analysed using primer pairs specific to introns adja- 
cent to the Ja segments included in the spliced transcripts analysis. No amplifica- 
tion was detected without reverse transcriptase. 

iJu56, forward, CCCTTGGAACCCTGATATGCG; iJa.56, reverse, CAGCCATT 
GTTTGGATTGGA; iJu52, forward, CTGTGGGCTGTTGCTITCTG; iJo52, 
reverse, AGGGAAATGGTGGGTACACG; iJu44, forward, GAAGAGGCCTC 
ATGGAGGAA; iJu44, reverse, TCTCACAGACCGAAGGGACA,; iJx38, forward, 
AGGACTTGGGGTCATCTCCA; iJ38, reverse, GGCTCCTCAGGACAGAC 
CAG; iJ«31, forward, GCAGCCTGCCAGCTATCTTT; iJa31, reverse, CAAAA 
GCAGCAACCCAACAA; iJx23, forward, AGAGGAGGCCGAAAGTCTCC; 
iJu23, reverse, TGCACAAATCCAGGCCTATG; iJa3, forward, TCTGGACCC 
TTGGCAATCAT; iJa3, reverse, CCCTGCCCTGGTCTACTGTG; iJ52-C6, for- 
ward, GGGTTTTGAGTTGGCTTTGG; iJé2-Cé, reverse, GGCATGTATTTT 
GCGGTTGA. 

ChIP-seq. Single-read high-throughput sequencing libraries were prepared from 
10ng of ChIP DNA and sequenced according to the manufacturer’s protocols 
(Illumina Genome Analyser II). Reads of 38 bases were aligned to the Mus musculus 
mm9 genome assembly using ELAND (Illumina) and wig files generated with 
FindPeaks** were visualized with the UCSC Genome Browser (http://www.genome. 
ucsc.edu/). 

RNA sequencing. 101g total RNA were depleted of ribosomal RNA using 
RiboMinus (Invitrogen). Depletion was verified using an Agilent 2100 
Bioanalyser and the retention of mRNA by real-time RT-PCR. The samples were 
treated with DNasel, and the RNA was fragmented for 2 min at 94°C in RNA 
fragmentation buffer (final concentration: 40 mM Tris acetate, pH 8.2, 100 mM 
potassium acetate and 30 mM magnesium acetate). The reaction was stopped by 
placing the samples on ice and the fragmented RNA was column purified 
(RNAeasy Minelute Kit, Qiagen), eluted in 10 1, and used as template for 
cDNA synthesis using 5g random hexamers in a total volume of 20 ul. 
Second-strand synthesis was performed by adding 91.81 water, 30 ul 5* 
Second strand buffer (Invitrogen), 3 ul 10 mM dNTPs, 4 ul E. coli polymerase I 
(10U wl"), lull E. coli DNA ligase (10U ul!) and 0.2 pl RNase H (10U pl’), 
followed by 2h at 16 °C. 1 pl T4 DNA Polymerase (10 U pl) was added followed 
by an additional 10 min at 16 °C. The double-stranded cDNA was purified using 
the Minelute Reaction Cleanup Kit (Qiagen), eluting twice in 10 ul. Following 
quantification, 20 ng double-stranded cDNA was used for the preparation of a 
single-read high-throughput sequencing library according to Illumina protocols. 
Rearrangement frequency analysis by semi-quantitative PCR. Threefold serial 
dilutions of 100 ng genomic DNA were used for PCR amplification with forward 
primers specific to Va5 or Vu7 and reverse primers specific to Ju56, Jx38 or Jo27, 
respectively”? (this reference refers to Va5 as TRAV3-4 and Va7 as TRADV 15-1). 
Primers for the Calreticulin locus were used as a loading control. PCR products 
were separated by agarose gel electrophoresis and stained with ethidium bromide. 

Vu7, forward, GCAGGATCTAATGTGGCCCAGAAAGTGATTCA;  Va5, 
forward, AGGTGATCACAGAGGCATCCT; Ja56, reverse, ACGTACCTGGTA 
TAACACTCAGAAC; Jau38, reverse, CAAAGACGACTTTGTTGG; Ja27, 
reverse, TTAAGAGCCCAAGCAGATGCATAAG;; calreticulin, forward, TCAT 
GAGTTCCCCACATCTTTG; calreticulin, reverse, CTGCCCTATCCTGAGTC 
TGACA. 
3C. Cells were fixed in 10% FCS, 1% formaldehyde for 10 min at room temper- 
ature (22 °C) and fixation was stopped with glycine (0.125 M). 10’ cells per sample 
were lysed in 10mM Tris, pH 8, 10mM NaCl, 5mM MgCh, 0.2% NP-40 for 
30 min on ice. The nuclei were pelleted and re-suspended in 0.5 ml 1.2 digestion 
buffer (NEB2, New England Biolabs) and permeabilized with SDS (0.5% final 
concentration) for 1h at 37 °C, shaking at 800 r.p.m. and 3.3% Triton X-100 were 
added for an additional 1 h at 37 °C. 2,000 U HindIII (New England Biolabs) were 
added before incubation over night at (37 °C, 800 r.p.m.) and inactivated with SDS 
(1.5%, 65 °C, 30 min). The reaction was diluted in 6.2 ml 1.1X T4 ligase buffer 
(New England Biolabs) and incubated at 37 °C for 1h after addition of 1% Triton 
X-100. 800 U T4 DNA ligase (New England Biolabs) was added for 4h at 16 °C, 
crosslinking was reversed by 300 jig proteinase K (65 °C, 16h). 300 pg RNase A 
was added for 1 h at 37 °C. DNA was isolated by phenol/chloroform extraction and 
ethanol precipitation, quantified using Quant-iT PicoGreen (Invitrogen) and 
200ng DNA were used per TaqMan PCR reaction (QuantiFast, Qiagen). Data 
were normalized to the cross-linking frequency between the anchor and the 
neighbouring HindIII fragment. HindIII digestion efficiency was calculated as 
described” (Supplementary Fig. 8a). The efficiency and linearity of 3C primers 
was tested on templates obtained by HindIII digestion and religation of genomic 
PCR products spanning the HindIII restriction sites. The products of each primer 
pair were gel purified and quantified to calculate copy numbers. Standard curves 
were constructed by tenfold serial dilution (2 X 10° to 20 copies). 3C primers were 
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selected for >90% efficiency and efficiency and a logarithmic coefficient of deter- 
mination (R’) >0.99 (Supplementary Fig. 8b, c). 

Distance from anchor, —75 kb; 3C primer, CTGCCCAAAAGAGAGCTTGG; 
reverse primer, GCATGTGAATGAGAGAGGATGG. Distance from anchor, 
—73kb (TEA); 3C primer, GCGGATTTGGGCAAGAATAA; reverse primer, 
CACAGGCTGCTCCAGAGATG. Distance from anchor, —50kb; 3C primer, 
TCTCGAAGGTCCACAAAGCA,; reverse primer, GGACACGCTTCCTTGGCT 
AC. Distance from anchor, —44kb; 3C primer, AGTCGTGTGGCTGAACCT 
TTC; reverse primer, TGGTGTCTTTATGGGGTTTAGGA. Distance from anchor, 
—26 kb; 3C primer, TTGGCCAGGGGACCATATTA; reverse primer, CTTGCTTC 
CCGTGATGTCTG. Distance from anchor, —19 kb; 3C primer, CACAGGAGAA 
ATGAGGCTCAGA,; reverse primer, CTGACCAGGCCTTCCAGATAA. Distance 
from anchor, — 13 kb; 3C primer, TGTCCTCTGCCACAAGTGCT; reverse primer, 
TAGGAGGACGAGAGGCTGGA. Distance from anchor, 0kb (anchor); 3C 
primer, AGCATGGGAGACAGACGTAATG; reverse primer, ATTCCGGTCTCC 
GTGTGAAT. Normalization fragment, 3C primer, ACCTAGTGACGCCCTG 
GTGA; reverse primer, TTGGCAGGAGTCGGTTCTTT. Distance from anchor, 
8 kb; 3C primer, AATGCGGGCATCTCAGACTT; reverse primer, GAGACTCCC 
AGGCCTTACCC. Distance from anchor, 9kb; 3C primer, GCACTTGTTCTA 
TACTGCCCAAGA,; reverse primer, ATTGTCTGGAACGGCACCA. Distance 
from anchor, 13 kb; 3C primer, ATTGTCTGGAACGGCACCA,; reverse primer, 
GCACTTGTTCTATACTGCCCAAGA. Distance from anchor, 19 kb; 3C primer, 
GGGGGAAAACAACACATCAGA,; reverse primer, GGGTCATCATTCATCCC 
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AGTC. Distance from anchor, 23 kb; 3C primer, TTTAAAAGGCCAGACTGCA 
ACA; reverse primer, CTCCCACATCCATTTCACCA. TaqMan probe, 3C primer, 
[FAM]-CCTGCCTGCCTGAG GACTGC-[BHQ]]. 
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A young source for the Hawaiian plume 


Alexander V. Sobolev'?*, Albrecht W. Hofmann”, Klaus Peter Jochum?, Dmitry V. Kuzmin?> & Brigitte Stoll? 


Recycling of oceanic crust through subduction, mantle upwelling, 
and remelting in mantle plumes is a widely accepted mechanism to 
explain ocean island volcanism’. The timescale of this recycling is 
important to our understanding of mantle circulation rates. 
Correlations of uranogenic lead isotopes in lavas from ocean 
islands such as Hawaii or Iceland, when interpreted as model iso- 
chrons, have yielded source differentiation ages between 1 and 
2.5 billion years (Gyr)”°. However, if such correlations are pro- 
duced by mixing of unrelated mantle components’ they will have 
no direct age significance. Re-Os decay model ages take into 
account the mixing of sources with different histories”*, but they 
depend on the assumed initial Re/Os ratio of the subducted crust, 
which is poorly constrained because of the high mobility of rhe- 
nium during subduction’. Here we report the first data on *’Sr/*°Sr 
ratios for 138 melt inclusions in olivine phenocrysts from lavas of 
Mauna Loa shield volcano, Hawaii, indicating enormous mantle 
source heterogeneity. We show that highly radiogenic strontium in 
severely rubidium-depleted melt inclusions matches the isotopic 
composition of 200-650-Myr-old sea water. We infer that such sea 
water must have contaminated the Mauna Loa source rock, before 
subduction, imparting a unique ‘time stamp’ on this source. Small 
amounts of seawater-derived strontium in plume sources may be 
common but can be identified clearly only in ultra-depleted melts 
originating from generally highly (incompatible-element) depleted 
source components. The presence of 200-650-Myr-old oceanic 
crust in the source of Hawaiian lavas implies a timescale of general 
mantle circulation with an average rate of about 2 (+1) cmyr', 
much faster than previously thought. 
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Figure 1 | Compositions of lavas and melt inclusions in olivine phenocrysts 
from recent (younger than 50 kyr) eruptions of Mauna Loa volcano, Hawaii. 
Data for Puu Wahi lava and melt inclusions (MI) are from this study; other melt 
inclusions are from ref. 13 and unpublished data of A.V.S. UDM inclusions are 
outlined. Compositions of Mauna Loa lavas are from the GEOROC database 
(http://georoc.mpch-mainz.gwdg.de/georoc/). All ratios are normalized to 
primitive mantle*’. 


Melt inclusions in highly magnesian olivine are commonly used as 
proxies for parental melts’®. In addition, melt inclusions provide 
unique information on the isotope heterogeneity of the mantle sources 
of ocean island basalts'’’*. Compositions of melt inclusions in olivine 
(Supplementary Figs 1 and 2) from lavas of the largest Hawaiian shield 
volcano, Mauna Loa, vary considerably’* (Fig. 1). Both La/Sm and Sr/ 
Ce ratios of melt inclusions have very large (tenfold) variations, and 
this heterogeneity is much greater than those displayed by Mauna Loa 
bulk lava compositions. About 2-3% of melt inclusions are strongly 
depleted in the highly incompatible elements Th, U, Ba, Rb, K, Sr, Nb, 
Ta, Cl, B, Pb, S, Zr, Hf and the light rare-earth elements (Figs 1 and 2 
and Supplementary Table 2). Here we will call these inclusions “ultra- 
depleted melts’ (UDMs). In contrast with typical Mauna Loa melts, 
UDMs are also slightly depleted in the moderately incompatible ele- 
ments Ti, Nd, Sm and Al and are enriched in SiO;, but are similar in 
contents of heavy rare-earth elements, Y, Sc and Ca (Fig. 2, Sup- 
plementary Fig. 3 and Supplementary Table 2). Strontium is signifi- 
cantly depleted relative to Pr, Nd and Sm. UDM inclusions have been 
reported in association with ‘normal’ inclusions in the same olivine 
grains’*. In such an association, inclusions of mixed composition are 
commonly present. Mauna Loa UDMs have been interpreted’ as 
instantaneous melts produced by near-fractional melting of Hawaiian 
source by a process that is similar to that described for mid-ocean-ridge 
UDM™. The isotope data presented here disprove this interpretation, 
showing that these UDMs also have significantly different source com- 
positions from those of other parental melts. 

Strontium and lead isotopic compositions of melt inclusions in 
olivine phenocrysts from a single Mauna Loa lava (see Supplemen- 
tary Figs 1 and 2 for inclusion images) were measured by laser ablation 
inductively coupled plasma mass spectrometry (LA-ICP-MS; see 
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Figure 2 | Primitive mantle*! normalized concentrations of incompatible 
elements in melt inclusions in euhedral olivine crystals from a single sample 


(K97-15b) of Puu Wahi scoria cone, Mauna Loa, Hawaii. Ultra-depleted melt 
inclusions are labelled UDM. 
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Methods for sample description and analytical details). In total, 138 
inclusions were analysed for Sr isotopes, and 106 inclusions for Pb 
isotopes. Both Sr and Pb isotopic compositions of melt inclusions vary 
markedly, much more than in Hawaiian lavas overall. But their 
averages are nevertheless similar in compositions to those of matrix 
glass and of typical recent Mauna Loa lavas (Fig. 3). The range of 
variation is markedly higher for melt inclusions trapped in the most 
magnesian olivines (Supplementary Fig. 4). This is consistent with the 
interpretation that melt inclusions in early-formed olivine phenocrysts 
yield information on the compositions of unmixed parental melts, 
whereas lavas and inclusions in more evolved olivines are mixtures of 
these melts’*’’. Clear evidence of such mixing is presented by the most 
radiogenic melt inclusions with *’Sr/*°Sr > 0.7060. These inclusions con- 
sistently fit mixing trajectories for almost all incompatible elements and 
Sr isotopes, thus strengthening confidence in the reliability of the isotope 
data (Supplementary Fig. 5). UDMs represent the endmember of such 
mixing combining highly radiogenic Sr (°’Sr/*°Sr = 0.7081 + 0.0006, 
2c), much higher than the value of typical Mauna Loa melts 
(°’Sr/*°Sr ~ 0.7038), with Rb/Sr ratios (0.010-0.013) that are unusually 
low for Mauna Loa (Fig. 3a). Lead isotope compositions of UDMs are 
also significantly more radiogenic than those of typical Mauna Loa melts 
(Fig. 3b; note that the lowest 207Db/7°6Pb ratios correspond to the most 
radiogenic Pb). These data clearly suggest that the UDMs could not 
have been produced from any typical Mauna Loa source by any melt- 
ing process. Instead, the source rock must contain a component with 
highly radiogenic Sr and Pb, but must at the same time be strongly 
depleted in incompatible elements. Alternatively, this component 
might have been introduced by some type of contamination of the 
primary melt. Next we discuss possible sources. 

Before presenting our preferred interpretation, namely source con- 
tamination by ancient sea water before subduction, we discuss alterna- 
tive explanations. In general, the highly radiogenic Sr of Mauna Loa 
UDMs might be explained by one of the following processes, all of 
which have significant drawbacks. 

First, direct input of recycled continental crust could in principle 
explain highly radiogenic Sr, but this is completely inconsistent with 
the extremely low concentrations of Rb, Ba, K, U and Pb, as well as 
their high Ce/Pb (50-80) and Nb/U (41-46) ratios”. 


Second, continental Sr might be transferred to the mantle source 
during subduction more indirectly by the percolation of crustal fluids 
without necessarily causing trace-element enrichment, if the mantle 
lithology is a depleted harzburgite containing no clinopyroxene. This 
process requires that, in addition to Sr, all other incompatible elements 
including Sc, Ca and Al in UDMs come from a clinopyroxene-free 
harzburgite source. However, this contradicts the relatively high con- 
tents of moderately incompatible elements (Ti, Al, Ca and rare-earth 
elements heavier than Sm) in UDMs that are similar to those of typical 
Mauna Loa melts (Fig. 2 and Supplementary Fig. 3), because harzbur- 
gites are typically depleted in all these elements. Additional observa- 
tions that are very difficult to reconcile with any continental source, 
even in a markedly ‘fractionated’ form, are as follows: the very low Pb 
concentrations and high Ce/Pb ratios, as well as high Nb/U and Nb/Th 
ratios, are the opposite of what would be expected from a metasomatic 
(fluid) transfer of a sedimentary signature from a subducted slab or 
from recycled sediments (see Supplementary Information); and other 
ratios of trace elements of similar incompatibility (Ba, Rb, Th, U and 
Nb) are similar (though not identical) to those of oceanic basalts, in 
particular Hawaiian basalts, but quite different from those of contin- 
ental materials. 

Third, the radiogenic Sr might be derived directly or indirectly from 
modern sea water. Producing UDM by contamination with typical 
Mauna Loa melt is highly unlikely because such a contamination pro- 
cess cannot result in tenfold lower concentrations of most incompatible 
elements. Seawater assimilation would be somewhat more plausible ifit 
affected the original, highly depleted melt (rather than normal Mauna 
Loa melt. Perhaps the most efficient mechanism for this would be the 
assimilation of recent aragonitic carbonate containing a high concen- 
tration of radiogenic Sr and depleted in all incompatible elements’®. 
This process could add radiogenic Sr without significantly changing 
most of the incompatible-element budget of the melt. However, it 
would also increase Sr and Ca concentrations considerably. For 
example, a typical Mauna Loa melt with 87$1/8°Sr = 0.7040 would 
require the assimilation of about 5% high-Sr carbonate to match the 
isotopic composition of UDM. This would result in a sixfold increase in 
Sr concentration of UDM if the carbonate contaminant contained 
1,000 p.p.m. Sr, with *’Sr/*°Sr = 0.7090. This implies an unreasonably 


Figure 3 | °’Sr/*°Sr ratios in melt inclusions and 
matrix glass in euhedral olivine crystals of 
sample (K97-15b), Puu Wahi scoria cone, Mauna 
Loa, Hawaii. a-d, °’Sr/*°Sr ratios are plotted 
against Rb/Sr (a), *°’Pb/°°Pb (b), Sr (c) and Cl 
(d). Compositions of bulk Mauna Loa lavas from 
the GEOROC database are shown (in orange) for 
comparison. The compositions of average UDM 
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large negative Sr anomaly (half the primitive mantle value of Sr) in the 
initial, uncontaminated melt. Also, this process must increase the CaO 
content of the UDMs by at least 1.5 wt%, which is not observed (see 
Supplementary Fig. 3). Alternatively, seawater Sr might enter the melt 
by assimilation of highly altered ocean crust (including sediment). This 
could explain highly radiogenic Sr (Fig. 4) and low concentrations of 
incompatible elements, as well as more radiogenic Pb’’. However, this 
should also incorporate other seawater-specific components, particu- 
larly B, Cl and K, because these elements are strongly enriched in 
altered crust'*. But this is inconsistent with the extremely low concen- 
trations of Cl (10-15p.p.m.), B (0.3-0.4p.p.m.) and K,O (0.05- 
0.07 wt%) in the UDMs (Fig. 3d and Supplementary Table 2). Even if 
one assumes that these volatile elements were extracted from the 
assimilated crustal rocks by previous high-temperature metamorphism, 
this process still seems unlikely. To raise the originally low *’Sr/*°Sr 
ratio of the melt to the highly radiogenic value observed, the amount 
of such assimilated material would have to be large. Such extensive 
assimilation would also affect concentrations of Ca, the heavy rare- 
earth elements, Y and Sc, all of which are relatively high and very 
similar in UDMs and typical Mauna Loa lavas (Fig. 2, Supplemen- 
tary Fig. 3 and Supplementary Table 2). Alumina is slightly depleted 
in UDMs in comparison with Mauna Loa melts (Supplementary Fig. 3), 
and this also precludes the assimilation of any plagioclase-saturated, 
high-Al rocks, typical of oceanic crust. Similarly, we can rule out the 
assimilation of seawater-altered Hawaiian lavas, because of their high 
incompatible-element enrichment. 

Reaction with altered harzburgites and their partial melting may also 
deliver highly radiogenic modern seawater Sr to depleted Mauna Loa 
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Figure 4 | *’Sr/*°Sr ratios of the two most radiogenic Mauna Loa melt 
inclusions superimposed on the time evolution of Sr isotopic composition 
of sea water. a, Seawater evolution for the past 2 Gyr (ref. 22). b, Detailed Sr 
evolution for the past 1 Gyr (refs 19, 22). The green diamond indicates seawater 
composition at 90 Myr (ref. 19) corresponding to the age of lithosphere under 
the Big Island, Hawaii. The horizontal dashed line represents the pooled 
average *’Sr/*°Sr ratio of 21 analyses of two UDM inclusions. The dotted line 
and heavy black line indicate the upper and lower limits of range of two 
standard errors of the average *’Sr/*°Sr, respectively. We suggest that the 
radiogenic Sr in UDMs is derived from ancient sea water in the Mauna Loa 
source (see the text). The blue field represents the acceptable age range for the 
recycled seawater component. We reject ages marked by the red field because 
we have ruled out recent seawater contamination (see the text). We consider the 
lower limit of the shown range (°7Sr/*°Sr = 0.7075, indicated on both panels) to 
be a minimum estimate for the composition of the seawater component in the 
Mauna Loa source, because its isotopic composition must have been diluted by 
less radiogenic components from the original unaltered basalt and source 
peridotite. The most likely age of recycled seawater Sr is in the range 200- 
550 Myr, but it may be extended to 650 Myr (ref. 23). 
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melt. But this process will also affect concentrations of Ni, Mn and Fe, 
which is not the case (Supplementary Fig. 8). 

An additional argument against contamination by the present-day 
lithosphere beneath Hawaii is derived from the fact that this litho- 
sphere is slightly more than 90 Myr old. At the time that it was created, 
sea water had a §’Sr/**Sr ratio of 0.7074 or less (ref. 19), just below the 
lower limit of UDM Sr (Fig. 4). Because seawater Sr is introduced into 
the altered crust almost entirely during the first 3 Myr after crust 
formation”, it is highly unlikely that Sr contamination derived from 
90-Myr-old lithosphere can account for the observed composition 
(°’Sr/°°Sr = 0.7075) of the UDMs. 

Because none of the above processes provides a satisfactory explana- 
tion for the presence of highly radiogenic Sr and Pb in the UDM, we 
suggest that the radiogenic Sr was introduced through direct or indirect 
contamination by ancient sea water before subduction and recycling. 
In this case, other indicators of sea water or seawater alteration, K, B 
and Cl, should easily be removed from the system by mobilization 
during subduction”’. In addition, Ca, Al, rare-earth elements, Sc and 
Y in the melt would be constrained by the mineralogy of the mantle 
residue, specifically by the presence of garnet and high-Ca pyroxene. 
However, Sr in ancient sea water was, during most of Earth’s history, 
much less radiogenic than Sr in the UDMs, except for one specific time 
window, namely the period between 200 and 550 Myr ago”? (Fig. 4). 
According to other estimates” this period may be extended at 
maximum to 650 Myr. Therefore, if the effect is indeed caused by 
recycled seawater Sr, this sea water is unlikely to be older than 
650 Myr old. 

We propose that Mauna Loa UDMs reveal one of the recycled end- 
member compositions similar to ultra-depleted mid-ocean-ridge 
basalt (MORB)" or gabbros™ with Sr extensively exchanged with, 
and dominated by, seawater Sr because of its initially low content 
(see model in Methods). This explanation is also consistent with the 
strong negative Zr anomaly in Mauna Loa UDMs, which is a common 
feature of ultra-depleted MORB as well’. Highly depleted gabbronorites 
crystallized from remelted material of this kind possessing *’Sr/*°Sr up 
to ~0.708 have been reported in oceanic crust exposed in the Oman 
ophiolite*’. The Sr isotopes of the other recycled rocks (typical gabbros or 
normal MORBs) indicated by the compositions of melt inclusions 
(Fig. 3) were less affected by seawater alteration because of their 
higher Sr contents and/or different positions in the crust'®. The Sr-rich 
component of Mauna Loa lavas is particularly unradiogenic 
(°’Sr/*°Sr < 0.7030; see Fig. 3c), in agreement with its origin from 
recycled oceanic plagioclase-cumulate gabbros’’. Alternatively, sea- 
water Sr might be introduced into the subduction ‘package’ by means 
of serpentinites or carbonate veins that are ubiquitous in the oceanic 
crust. Particularly when precipitated at low temperatures, they pre- 
serve the Sr isotopic composition of sea water'®. We further suggest 
that all these recycled materials have been processed during subduction 
to remove most of the additional Cl, B, Rb and K introduced by sea 
water. 

Previous estimates of recycling times in mantle plumes have mostly 
relied on Pb isotopes. If an initially homogeneous source is differentiated 
(for example, by melting) into subsystems with variable U/Pb ratios, these 
subsystems evolve to a linear isochron in 7” Pb/?**Pb—*°°Pb/™ Pb space. 
Such linearly correlated Pb isotopes are widespread in oceanic basalts 
and have frequently been interpreted as isochrons corresponding to 
recycling ages ranging from 1.0 to 2.5 Gyr (ref. 2). However, they might 
also simply represent mixing lines between unrelated reservoirs’. It has 
been pointed out’® that the isochron interpretation fails when Th/U 
ratios inferred from the slope of 7°°Pb/?™*Pb-7°°Pb/*™ Pb correlations 
are inconsistent with observed Th/U ratios in the rocks, and it was 
concluded that the Pb isotope correlations for Mauna Kea must rep- 
resent mixing lines rather than isochrons. 

All Mauna Loa olivine phenocrysts possess Ni excess and Mn defi- 
ciencies, which suggests the involvement of melts from olivine-free 
reaction pyroxenite and peridotite sources in proportions of about 
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60:40, respectively*’”* (Supplementary information). From one-half to 
two-thirds of this reaction pyroxenite is derived from high-silica melt 
from recycled crust in the form of eclogite, with the other part being 
peridotite’’. In total this yields 60-70% peridotite component in the 
source of Mauna Loa magma. However, because 87¢7/°°Sr in the 
UDMs is nearly at the maximum possible value for sea water’’, this 
implies that most of the Sr, as well as other incompatible elements, 
came from the recycled crustal component. This leaves very little room 
for peridotite-derived incompatible elements, and thus requires that 
the peridotitic component was severely depleted in these elements, 
including Sr, so that it did not significantly affect the incompatible 
element budget of the final melt (see Methods and Supplementary 
Information for the quantitative model). Alternatively, peridotite 
could be moderately depleted in incompatible elements, but it would 
contain radiogenic Sr resulting from seawater alteration, and this 
would represent part of the recycled oceanic lithosphere. 

Our interpretation of the UDM compositions implies an age for the 
recycled component in the Mauna Loa source of between 200-650 Myr, 
much younger than previously suggested source differentiation ages of 
deep mantle plumes**”*. Such a young age is consistent with a timescale 
of general mantle circulation with an average rate of about 1-3 cm yr ~ I 
assuming that subducted crust was delivered to the core-mantle boundary 
at 2,900 km depth, and the Hawaiian plume rises from that depth. 


METHODS SUMMARY 


The picritic sample K97-15b from Puu Wahi scoria cone (age 910 yr), Mauna Loa 
volcano, Hawaii, contains euhedral olivine grains (Fo 89-82) with naturally 
quenched melt inclusions and matrix glass. 

Major and some trace elements in melt inclusions, glass and host olivines were 
determined by electron probe microanalysis on a Jeol JXA 8200 SuperProbe 
Electron Probe Microanalyser at the Max Planck Institute for Chemistry 
(MPIC) Mainz, Germany, with a 2¢ relative error of 1-2%. The compositions of 
olivine and contents of Cl, S, Ni and Cr in inclusions were analysed by following a 
special procedure that allows precision and accuracy of 20-30 pg g' (26 error) for 
Ni, Ca, Mn, Al, Ti, Cr and Co, 0.02 mol% for the forsterite component in olivine”’, 
4yigg | for Cl and 20 ugg ' for S in glass. 

LA-ICP-MS was used to determine trace elements in glasses of melt inclusions 
onan Element-2, Thermo Scientific mass spectrometer with a UP-213 New Wave 
Research solid-phase laser at MPIC, with a precision and accuracy of about 10- 
15% (2 relative standard deviations (r.s.d.)). Boron concentrations were measured 
by secondary-ion mass spectrometry on a Cameca Ims-3f ion probe at MPIC with 
an accuracy and precision of about 20% (2 1.s.d.). 

In situ Sr and Pb isotope analyses of melt inclusions were performed at MPIC 
with the ICP mass spectrometer Thermo Element-2 and the New Wave UP 193 
laser ablation system with a precision and accuracy of about 0.02-0.10% (1 r.s.d.) 
for Sr (new data and ref. 29) and 0.1% 7°8Pb/?°°Pb and 7°’Pb/?°°Pb (ref. 30) for 
one-to-three-spot analyses of melt inclusions. 

Quantitative modelling suggests that highly radiogenic Sr and the chemical 
composition of Mauna Loa UDMs can result from recycling, melting and reacting 
of depleted oceanic crust altered by sea water and depleted mantle peridotite. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Samples. The sample K97-15b has been collected by Alfred Anderson at Puu 
Wahi, from a scoria cone of 910-year-old picritic lava situated at about 3,000 m 
elevation on the northeast rift zone of Mauna Loa volcano, Hawaii. The melt 
inclusions hosted by euhedral olivine grains (Fo 89-82) of millimetre size were 
naturally quenched during eruption and formed fresh glass and small (a few 
volume per cent) low-density shrinkage bubbles (Supplementary Fig. 1). 
Isotope analyses. In situ Sr and Pb isotope analyses of melt inclusions were 
performed at the Max Planck Institute for Chemistry in 2006, with the ICP mass 
spectrometer Thermo Element-2 and the New Wave UP 193 laser ablation system 
(wavelength 193nm, energy density 4 Jem’, spot sizes 501m (Sr) and 75- 
100 ttm (Pb), pulse repetition rate 10 Hz)””*°. In addition, Sr was remeasured in 
October 2010 and February 2011 after improvements of the detection system, to 
reaffirm and improve the overall reliability of the Sr isotope results. 

The precision and accuracy of our LA-ICP-MS method in measuring *’Sr/*°Sr 
ratios was constrained by analysing well-documented reference glasses (see 
Supplementary Table 1). The external precision of data obtained in 2006 was 
reported in ref. 29 as the standard deviation (s.d.) of sample measurements and 
is a strong function of Sr content of glasses (see Supplementary Fig. 6). From this 
equation (obtained for the reference glasses) one can predict s.d. = 0.0010 for 
UDMs containing 65 p.p.m. Sr. For the nine individual UDM measurements in 
2006 the corresponding predicted standard error of the mean (s.e.m.) is 0.0003. 
These predicted values are slightly smaller than the actually observed s.d. of 0.0016 
and 0.0005 for UDMs. This result is to be expected because reference glasses are 
significantly larger in all three dimensions than the measured inclusions and 
generally yield a somewhat larger number of total counts, yielding slightly higher 
precision. 

To substantiate our results and further improve precision we reanalysed (in 
October 2010) the Sr isotope ratios of 21 melt inclusions, including both UDMs, 
using a significantly upgraded LA-ICP-MS system providing a combined effect of 
an improved stability and higher sensitivity. The improvements include a new 
multiplier system, a new ion detection unit and front-end computer of the ICP 
mass spectrometer, and new adjustment of the laser including cleaning of the 
valves and tubing in the laser ablation system. These technical improvements 
led to a laser fluence that was higher (about 10J cm 7 in comparison with about 
4Jcm *) and more uniform (decrease to about 2% during 4h of operation, from 
about 25% before the improvements), higher counting rates (by about twofold) 
and lower blanks (for example, for **Sr about 70c.p.s. in comparison with 
300 c.p.s.”’). 

The higher precision and accuracy of the new data, for low-Sr samples, is clearly 

shown by new analyses of the reference glasses (Supplementary Fig. 6 and 
Supplementary Table 1), performed concurrently with those of the inclusions. 
Note that all natural reference glasses yield means of measured *’Sr/*°Sr ratios within 
one standard error of the reference value, which proves that the standard error is an 
appropriate measure of both precision and accuracy. The predicted maximum pre- 
cision (and accuracy) of ®7S1/°°Sr ratios for the new measurements of UDM inclu- 
sions, estimated from these data, is s.d. = 0.0004 for an individual measurement and 
s.e.m. = 0.0001 for the mean of 12 measurements. A significant advance in precision 
of UDMs and reference glasses with similar low Sr contents is evident from 
Supplementary Tables 1 and 3. All reanalysed melt inclusions match the older data 
set very well but with better precision (see Supplementary Fig. 7). The new data for 12 
individual measurements of the UDM yield *’Sr/*°Sr = 0.7082 + 0.0006 (two 
s.e.m.). If we pool the old and the new data, we have 21 measurements yielding a 
virtually identical mean value of 876r/°°Sr = 0.7081 + 0.0006 (two s.e.m.). Although 
the precision of the new data are somewhat better than that of the older analyses, we 
use the full data set of 21 analyses because the two data sets, separated by a time gap of 
four years, show excellent consistency. 
Analysis of major and trace elements. Major and trace elements were deter- 
mined by electron probe microanalysis on a Jeol JXA 8200 SuperProbe Electron 
Probe Microanalyser at MPIC, Mainz, Germany. Major-element abundances in 
glasses were measured at an accelerating voltage of 15 kV and a beam current of 
12nA with a reference sample of natural basaltic glass USNM111240/52 (VG2)” 
with a relative error of 1-2%. The compositions of olivines and Cl, S, Ni and Cr in 
glasses were analysed at an accelerating voltage of 20kV and a beam current of 
300 nA, by following a special protocol’® that allowed 20-30 p.p.m. (20 error) 
precision and accuracy for Ni, Ca, Mn, Al, Ti, Cr and Co, and 0.02 mol% for the 
forsterite component in olivine and 20 p.p.m. for Cl and S in glass. 

Trace elements in glasses of melt inclusions were determined by LA-ICP-MS 
on an Element-2, Thermo Scientific mass spectrometer with a UP-213 New 
Wave Research solid-phase laser at MPIC, with reference to the KL-2G and 
NIST 612 standard glasses*’ (see http://georem.mpch-mainz.gwdg.de). Ca was 
used as a reference element. The typical conditions were: laser diameter 60- 
80 jum, energy density about 7Jcm *, pulse repetition rate 10 Hz and ablation 
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time 60-80s. The element abundances were determined with 2¢ errors of no 
more than 5% and 10% for concentrations above 1p.p.m. and ~0.1p.p.m., 
respectively. 

Boron content in glasses of melt inclusions was measured by secondary-ion 

mass spectrometry on a Cameca Ims3F ion microanalyser at MPIC with reference 
to the KL-2G and NIST 612 standard glasses by following the protocol’ with a 
relative error commonly within 10%. The detection limit for B, estimated from lp 
intensity on the host olivine, was below 0.02 p.p.m. 
Modelling. According to our model, the parental Hawaiian melts are mixtures 
of melts produced by the melting of olivine-free reaction pyroxenite and peri- 
dotite in which the proportion of pyroxenite-derived melt (Xpx) is estimated 
from the olivine composition’’”*. The pyroxenitic source is generated by reac- 
tion between high-Si, eclogite-derived melt and peridotite, with the proportion 
of reactive melt (¢) similar to the original amount of olivine in peridotite’’”. The 
other variables that constrain the composition of the final melt are: degrees of 
melting of eclogite (F;) and peridotite (Fp,), their chemical, isotopic and phase 
compositions, the degree of melting of pyroxenite (Fpx), melting reactions, and 
partition coefficients between melt and crystals. Mass conservation requires the 
following relations between the contents of bulk Sr, 86sr and °’Sr in the final 
mantle-derived, primary melt (PM), peridotite (PE) and eclogite (E) for a batch 
melting process: 


: : Xpx x (1—@) (1—Xpx) 
'Stpy = !Stpp % Li Sr Lifes Sr 
Fox +(1—Fpx) x Kpx Fog + (1 — Foe) x Kop (1) 
isr, X gx Xvx 
Sr; 5 
© (Fox + (1 —Fex) x K8k) x (Fe + (1 — Fg) x Ke") 


where i = 86 or 87, nd (bulk Sr) Kpx, Kp, and K;" are bulk distribution coeffi- 
cients for Sr or its isotopes between crystal phases and melt for pyroxenite, 
peridotite and eclogite, respectively. 
8655. St 
: 87S rj /86Sxj + (84S) + 8°Sxj + 88S1;)/8°Sx) 


(2) 


where j=PM, PE or E, and the ratio of unradiogenic stable isotopes 
(Sr; + 9S; +°8Sr))/*Sr; = 9.43205. 

Equation (1) for Sr, °’Sr and ®°Sr and equation (2) for primary melt, eclogite and 
peridotite give six equations with nine unknowns (Srp,, 87Stpp, “°Stpp, Stpmp 
®7Srpap *°Srpm, Srp, °’Srp and *°Srg). To resolve these equations one needs to 
constrain three unknowns. The seawater model presented in this paper allows a 
maximum *’Sr/*°Sr ratio of recycled eclogite of 0.7090 (first constraint), while the 
measured composition of the UDMs requires the minimum *’Sr/*°Sr ratio of the 
final melt to be 0.7075 (second constraint; see Fig. 4). The composition of the 
UDMs also allows the composition of primary UDM to be estimated by reversing 
olivine fractionation up to equilibrium with the most magnesian olivine for Mauna 
Loa (Fo 90.7; ref. 27). This yields Srpy = 51 p.p.m. Sr in the primary UDM melt 
(third constraint). 

Solving mass balance equations (1) and (2) for the above constraints and using 
reasonable values of g (0.60-0.65), Fy (0.5), Fpz (0.10-0.15), Fpx (0.45), Xpx (0.60), 
Kx =0.05, Ke =0.01 and Ke =0.15 (see the discussion on these parameters in 
ref. 27) gives a relation between contents and isotope composition of Sr in the 
peridotite component (Supplementary Fig. 9). Assuming a lower limit for the 
*’Sr/°°Sr ratio of peridotite of 0.7021 (extreme depletion) yields a maximum Sr 
content of peridotite of about 3 p.p.m., which fulfils the conditions stated above. 
Thus, as expected, the peridotite source should be more depleted in Sr than average 
depleted MORB mantle with a Sr content of 7.66 p.p.m.™. For a lower content or 
more radiogenic Sr in peridotite, the °’Sr/*°Sr in the recycled crust would be lower 
than 0.7090. It approaches 0.7075 (minimum allowed by the observed UDM 
composition) when there is no Sr in peridotite or the ratio *’Sr/**Sr in peridotite 
is 0.7075. 

The above model is somewhat oversimplified because it uses a batch melting 
process and considers fixed melting phase proportions in the form of constant bulk 
distribution coefficients. To take into account a more realistic, aggregated critical 
melting process and include changing phase proportions, we further use the con- 
ceptually similar but more advanced model with melting reactions and distri- 
bution coefficients explained in ref. 27. Using the above estimates for maximum 
Sr content in peridotite we chose for our quantitative modelling the composition of 
depleted abyssal harzburgite from the database*’ with a Sr content of 2.73 p.p.m. 
(model 1), and a restite after 2% of near fractional melting of depleted MORB 
mantle** with a Sr content of 1.96 p.p.m. (model 2). The results of modelling are 
shown in Supplementary Fig. 10 and Supplementary Table 4. 

The composition of recycled crust was calculated to match the trace-element 
composition and minimum *7Sr/*°Sr = 0.7075 ratio of UDM primary melt. For 
87S1/°°Sr = 0.7021 of peridotite, the results for recycled crust are *’Sr/*°Sr = 0.7086 
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(model 1) and 0.7083 (model 2). The estimated trace-element compositions of 
recycled crust correspond to highly depleted oceanic crust and are reasonably close 
to reported compositions of depleted MORBs**”” (Supplementary Fig. 10a). The 
compositions of peridotites correspond to depleted abyssal harzburgite or depleted 
lherzolite (Supplementary Fig. 10a). We conclude that the highly radiogenic Sr and 
the chemical composition of Mauna Loa ultra-depleted melts can be produced by 
recycling, melting and reacting of depleted oceanic crust altered by sea water and 
depleted mantle peridotite. This peridotite may be also a part of the recycled 
oceanic lithosphere. 

We further investigate an alternative possibility for the production of highly 
radiogenic Sr in ultra-depleted melt by the involvement of small amounts of 
continental sediments with an extremely high *’Sr/*°Sr ratio. We use the same 
model as model 1 but for °’Sr/*°Sr = 0.7025 of recycled oceanic crust and variable 
amounts of sediment with the most extreme *’Sr/*°Sr = 0.7349 (from Sumatra”*). 
This yields an amount of 1.6% of such sediment in the recycled component to 
match °’Sr/*°Sr = 0.7075 of the final melt. However, this substantially increases 
the concentration of highly incompatible elements such as Th and La of the final 
melt and significantly lowers its Nb/Th ratio (Supplementary Fig. 10d and 
Supplementary Table 4). We conclude that recycled continental material, even 


with extremely radiogenic Sr, cannot produce the highly incompatible trace- 
element pattern of Mauna Loa UDMs. 
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Structure of the membrane domain of 
respiratory complex I 


Rouslan G. Efremov'} & Leonid A. Sazanov! 


Complex I is the first and largest enzyme of the respiratory chain, coupling electron transfer between NADH and 
ubiquinone to the translocation of four protons across the membrane. It has a central role in cellular energy 
production and has been implicated in many human neurodegenerative diseases. The L-shaped enzyme consists of 
hydrophilic and membrane domains. Previously, we determined the structure of the hydrophilic domain. Here we 
report the crystal structure of the Esherichia coli complex I membrane domain at 3.0 A resolution. It includes six 
subunits, NuoL, NuoM, NuoN, NuoA, NuoJ and NuoK, with 55 transmembrane helices. The fold of the homologous 
antiporter-like subunits L, M and N is novel, with two inverted structural repeats of five transmembrane helices 
arranged, unusually, face-to-back. Each repeat includes a discontinuous transmembrane helix and forms half of a 
channel across the membrane. A network of conserved polar residues connects the two half-channels, completing 
the proton translocation pathway. Unexpectedly, lysines rather than carboxylate residues act as the main elements of 
the proton pump in these subunits. The fourth probable proton-translocation channel is at the interface of subunits N, K, 
J and A. The structure indicates that proton translocation in complex I, uniquely, involves coordinated conformational 


changes in six symmetrical structural elements. 


Complex I (NADH:ubiquinone oxidoreductase) is one of the largest 
known membrane proteins. It provides about 40% of the proton flux 
during proton-motive force generation for the synthesis of ATP by the 
mitochondrial respiratory chain’°. Many mutations in subunits of 
complex I have been associated with human neurodegenerative 
diseases*®. Complex I is a major source of reactive oxygen species in 
mitochondria: these can damage mitochondrial DNA and are con- 
sidered to be causative agents in Parkinson’s disease’ and ageing’. 
Mitochondrial complex I consists of 45 subunits (980 kDa in total)’. 
The simpler prokaryotic enzyme usually consists of 14 ‘core’ subunits 
(about 550kDa in total) that are conserved from bacteria to 
humans'**°, It contains equivalent redox components and has a 
similar L-shaped structure to that of mitochondrial complex I 
(refs 2, 5, 11). The high degree of sequence conservation of core 
subunits indicates that the mechanism of complex I is probably the 
same throughout all species. 

The complete atomic structure of this molecular machine is 
currently unknown. Structures of the Thermus thermophilus complex 
I hydrophilic domain’*”* have established the electron-transfer path- 
way from NADH to the primary electron acceptor flavin mononu- 
cleotide, and then through seven conserved iron-sulphur clusters to 
the putative quinone binding site at the interface with the membrane 
domain (which lacks covalently bound prosthetic groups). The three 
largest hydrophobic subunits of complex I, NuoL, M and N (E. coli 
nomenclature), are homologous to each other and to Na‘/H* anti- 
porter complex (Mrp) subunits’*’’. They are likely to participate in 
proton translocation. We previously determined the arrangement of 
a-helices in the membrane domain of E. coli complex I, but the limited 
resolution (3.9 A) prevented sequence assignment". 

Understanding how the transfer of two electrons from NADH to 
ubiquinone is coupled to the translocation of four protons (the 
current consensus value) across the membrane remains a major ques- 
tion in complex I research’ >». ‘Direct’ (redox-driven) and ‘indirect’ 


(conformation-driven) mechanisms have been proposed***'®. Recently 
published structures revealed an unexpected structural element, helix 
HL, extending along nearly the entire length of the membrane 
domain’*”’: this may act as a connecting element, coordinating con- 
formational changes''. A greater understanding requires a knowledge 
of the atomic structure of the membrane domain. 


Structure determination and overall architecture 


A new PI crystal form was obtained with protein purified in the 
detergent cymal-7. After extensive optimization, the diffraction 
extended to 2.7 A in the best direction. The crystallized fragment 
contains six subunits, NuoL, M, N, A, J and K (222kDa in total), 
but lacks NuoH, which dissociates readily from the complex’*’*. 
Multiwavelength anomalous diffraction (MAD) data sets were col- 
lected from SeMet-derivatized protein (Supplementary Table 1). 
Multi-crystal density averaging and modification led to experimental 
electron density maps of excellent quality (Supplementary Fig. 1). The 
model was refined to 3.0A resolution with R=23.2% and 
Réree = 28.3% (Supplementary Table 1), and includes 1,952 residues 
(out of 2,038, Supplementary Table 2). A cymal-7 molecule, frag- 
ments of lipid aliphatic chains and several well-ordered internal water 
molecules are also included. 

The model contains 55 transmembrane helices (TMs) (Fig. 1 and 
Supplementary Movie 1). Subunits NuoA, J and K form an 11-helix 
bundle near the interface with the hydrophilic domain, and the 
antiporter-like subunits NuoN, M, and L follow, as proposed earlier’. 
The distal subunit NuoL contains a carboxy-terminal extension that 
starts with TM15, followed by helix HL and ending with TM16, 
harboured at the interface with subunits NuoJ, K and N. Sequence 
conservation is high for the membrane core of all subunits (Sup- 
plementary Fig. 3), underlining its functional significance. Helix HL 
is poorly conserved, as might be expected for a mostly mechanical 
element, with the exception of residues contacting other subunits 
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Figure 1 | Architecture of the membrane domain of E. coli complex I. 
Cartoon representation of the atomic model. a, Side view in the membrane 
plane. b, View from the periplasm into the membrane. Subunits are coloured as 
indicated by the labels. Discontinuous helices are shown in red (TM7) and 
orange (TM12). Connecting elements, helix HL and f-hairpins (f-h), are 
labelled. The position of the lipid bilayer is estimated as in Supplementary Fig. 
4. c, Side view of the structure, aligned with the membrane domain of the entire 
complex I from T. thermophilus (PDB code 3M9S)"". Subunits that are not 
present in the E. coli structure are shown in grey. Flavin mononucleotide 
(FMN) is shown as magenta spheres; iron-sulphur clusters are shown as red- 
orange spheres, with cluster N2 labelled. Helix TM1 from subunit Nqo8/NuoH, 
helix H1 from Nqo6/NuoB and the probable quinone-binding site (Q) are 
indicated. 


(Supplementary Fig. 3b), indicating that this helix is similarly 
arranged across species. The polar residues of amphiphatic helix HL 
face the cytoplasm and apolar residues point to the membrane 
(Supplementary Fig. 4), indicating that helix HL resides at the surface 
of the lipid bilayer. The periplasmic surface of the membrane domain 
is mostly negatively charged and the cytoplasmic surface is positively 
charged, in agreement with the positive-inside rule’’. Lipid bilayer 
boundaries are clearly visualized by the belts of solvent-exposed Tyr 
and Trp residues” and by the hydrophobicity of surface-exposed 
residues (Supplementary Fig. 4). 


Fold of antiporter-like subunits 

Subunits NuoL, M and N contain 14 structurally conserved helices 
(Fig. 2a-c, Supplementary Fig. 5 and Supplementary Table 3). The 
fold differs from the bioinformatic prediction*’. Sequential trans- 
membrane helices contact each other, except for TM8 and TM9, 


2 | NATURE | VOL 000 | 00 MONTH 2011 


and TM13 and TM14. The two discontinuous helices observed previ- 
ously"’ are TM7 and TM12. They are interrupted in the middle of the 
bilayer by an extended loop of 5-7 residues. The tips of these loops 
contain a proline that is conserved between all three antiporter-like 
subunits: M(Pro 239)* in TM7 and M(Pro 399)* in TM12 (the prefix 
indicates the subunit name; asterisks indicate residues conserved 
between all three antiporter-like subunits). As in other transporters””’, 
such helices are likely to participate in proton or ion transport by 
introducing some flexibility and charge to the middle of the membrane. 

TMs 1, 2 and 14 form a lipid-facing layer at the opposite side of the 
domain from helix HL and are the least conserved in sequence and in 
structure (Supplementary Figs 3 and 5). TMs 2 and 3 are connected at 
the periplasmic surface by an extended f-hairpin. The core of the 
subunits is formed by the well-conserved helices 4-13. Their arrange- 
ment has internal symmetry: TMs 4-8 can be superimposed on TMs 
9-13 by a rotation of about 180° along the axis lying in the membrane 
plane and directed along the long axis of the domain, followed by a 
shift along this axis (Fig. 2c-f and Supplementary Movie 2). Thus, the 
two halves of the core, each containing five TMs, are related, uniquely, 
by symmetry along a pseudo-two-fold screw axis. Symmetry-related 
sets of helices in transporters may span the membrane in a parallel or 
antiparallel fashion, but have always been found in face-to-face 
arrangements, with substrate binding sites located near the symmetry 
axis at the domain interface’. However, in case of complex I 
antiporter-like subunits, the interface is formed by face-to-back con- 
tact, presenting a novel arrangement of symmetry-related helices. 
This places discontinuous helices 7 and 12 far apart, so that they 
cannot interact directly, in contrast to the situation in other trans- 
porters. It is likely that antiporter-like subunits have evolved by gene 
duplication”, even though the sequences of the two inverted domains 
(Fig. 2c-e) have diverged (Supplementary Table 3). 

At the centre of each antiporter-like subunit, TM8 is partly 
unwound in the middle and contains a kink with no proline residues, 
similar to the m-bulge in bacteriorhodopsin”*. The secondary struc- 
ture is markedly disrupted near the kink (residues L(253-258), 
M(259-264) and N(242-247)), indicating flexibility. Such m-bulges 
(or m-helices**) are normally found at protein functional sites”, indi- 
cating a mechanistic importance for the kink in TM8. 

Two well-studied, essential charged residues that are conserved 
from Mrp antiporters to complex I (refs 15, 26) are found in the 
middle of TM5 (L(Glu 144), M(Glu 144) and N(Glu 133); termed 
Glu TM5 here), and in the middle of TM7 (L(Lys 229), M(Lys 234) 
and N(Lys 217); Lys TM7). Lys TM7 is the last residue of the periplas- 
mic half of discontinuous TM7 (TM7a), and its s-amino group points 
inside the putative proton channel (termed here the first channel), 
formed by TMs 5-8. Notably, Glu TM5 is wedged at the interface of 
TMs 5 and 6, and is exposed both to this channel and to the cavity at 
the interface with the adjacent subunit. The Glu TM5 carboxyl group 
can interact electrostatically with the Lys TM7 amino group, which is 
about 5-6A away. An invariant Trp from TM7b (M(Trp 243)*) 
stabilizes the link between the two halves of TM7. The negative 
helix-dipole charge here may be partly compensated by N(Trp 247), 
M(His 248) and L(His 254). 

The area near the second discontinuous helix, TM12, harbours 
many conserved polar residues in the middle of the membrane, so is 
also of clear functional importance. In NuoL and NuoN, an invariant, 
essential’””* lysine (L(Lys 399) and N(Lys 395)) lies at the beginning of 
the periplasmic half of TM12 (TM12b) in a position approximately 
symmetrical to Lys TM7 (Fig. 2d, e). It may interact with, and stabilize, 
the negatively charged C terminus of TM12a. In NuoM, a similar 
position is occupied by an invariant Glu 407. Here, the negative dipole 
moment may be stabilized by the invariant His 322. Notably, Glu 407 is 
replaced by an invariant lysine in the homologous MrpD subunit of 
antiporters. The side chain of this Lys or Glu (termed Lys/Glu TM12) is 
exposed in a large cavity between TMs 10-13. Thus, unexpectedly, a 
second putative proton channel is present here, related by inverted 
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Figure 2 | Fold of antiporter-like subunits. The fold is illustrated using 
subunit NuoM. a, Side view, cytoplasmic side up, centred on TM 12. b, Side view 
centred on TM7, coloured blue to red from N to C terminus. Essential charged 
residues are shown as sticks (Glu TM5 and Lys TM7 from channel 1, GluTM12 
from channel 2 and the connecting M(Lys 265) from TM8). Conserved prolines 
from intra-helical loops are also shown. c, View from the cytoplasm into the 
membrane. TM helices are numbered. Two inverted repeats are circled and the 


symmetry to the first channel. Additionally, N(Lys 395) is exposed to 
the cavity between NuoN and NuoM, and M(Glu 407) is partially 
exposed to the cavity between NuoM and NuolL, similarly to Glu TM5. 

Searches using either complete subunits or the repeating domain 
revealed no structural analogues in the protein data bank (PDB), 
confirming that the fold is novel. Homologous subunits in Mrp anti- 
porters, chloroplast Ndh complexes and membrane-bound hydroge- 
nases, not yet structurally characterized, are likely to share the same 
core fold. 


Fold of subunits NuoK, NuoJ and NuoA 


In contrast to NuoL, M and N, the smaller subunits interact exten- 
sively, with large surface areas buried at interfaces. They also form 
more hydrogen bonds and salt bridges to their neighbours (Sup- 
plementary Table 4), ‘gluing’ together this end of the membrane 
domain. 

NuoK spans the membrane with three linearly arranged a-helices, 
connected by short loops (Supplementary Fig. 6a). NuoK interacts 
extensively with NuoN (Fig. 1b), consistent with the presence of the 
analogous MrpC subunit in antiporters”. The C terminus of NuoK 
extends between helix HL and NuoN, forming many inter-subunit links. 
NuoK can be aligned with TMs 4-6 of antiporter-like subunits (and 
also with TMs 9-11, but with low sequence identities) (Supplementary 
Table 3). Notably, conserved K(Glu 36) aligns to Glu TM5, indicating 
that NuoK and the antiporter-like subunits might share an ancestor. 

Subunit NuoJ has an unusual non-globular fold: three linearly 
arranged amino-terminal TM helices border NuoK, and TMs 4 and 
5 are separated at the opposite sides of the domain (Supplementary 
Fig. 6b). Thus, NuoJ interweaves between NuoK, A and N, stabilizing 
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pseudo-two-fold symmetry screw axis that relates them is shown. d, Overlay of 
the two symmetry-related domains (TMs 9-13, in grey, over TMs 4-8). 

e, Topology diagram of antiporter-like subunits. Two inverted repeats of the 
conserved core are shown in green and magenta. Crucial charged residues 
(Glu TM5, Lys TM7 and Lys/Glu TM12) are indicated. f, Structural repeats in 
the membrane domain, seen from the cytoplasm. TMs 4-8 are highlighted in 
green and TMs 9-13, in magenta. 


the complex. TM3 of NuoJ contains in its middle a m-bulge/kink 
(residues 59-62), similar to TM8 in NuoL, M and N, so this helix is 
probably flexible and mechanistically important. 

Subunit NuoA consists of three linearly arranged TM helices at the 
edge of the domain, interacting with NuoJ, N and K (Supplementary 
Fig. 6c). Its outer surface, including the conserved A(Asp 79) in the 
middle of the membrane, probably interacts with NuoH (Fig. 1c). A 
conserved loop between TM1 and TM2 is not resolved in the structure 
and probably interacts with the hydrophilic subunit NuoD’’. The 
probable quinone-binding site is formed at the interface of subunits 
NuoH, J and A with the hydrophilic domain" (Fig. 1c). 


Interactions between subunits 


About half of the total subunit surface area is buried away from the 
solvent (Supplementary Table 4), consistent with the relative stability 
of the domain. However, interactions between antiporter-like subunits 
are not extensive. They mainly involve two helices from each subunit 
(TM5 and 6 and TM12 and 13), and are mostly hydrophobic, with very 
few hydrogen bonds (Supplementary Table 4). Crevices between sub- 
units are likely to be filled with lipids, some of which are ordered and 
modelled here. 

Helix HL stabilizes the complex. It is tightly anchored to NuoN, J 
and K by L(TM16), which also contacts N(TM7). HL contacts NuoM 
near M(TM7), where HL is unwound, allowing for strong interactions 
via the backbone and side chains (Fig. 3 and Supplementary Table 4). 
Ina similar fashion, helix HL forms hydrogen bonds to the main body 
of the domain near TM7 of NuoL and N. These contacts involve 
conserved M(Lys 173)*, M(Asp 246)* and M(Tyr 317)*, and a semi- 
conserved residue, M(His 241). 
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Figure 3 | Connecting elements and interactions between subunits. a, View 
from the periplasm. Helix HL is in purple and the B-hairpin-helix element BH 
(consisting of hairpins B-h and helices CH) is in blue. The contacting helix from 
NuoJ (preceding TM5 of NuoJ) is in cyan. Helices that are probably involved in 
conformational changes are shown in red (TM7 in NuoL, M and N), orange 
(TM12), yellow (TM8) and green (TM3 in NuoJ). Essential charged residues 
are shown as sticks and labelled (prefix indicates subunit). Also shown are 


Notably, an additional element connecting antiporter-like subunits 
is present on the opposite side of the domain from helix HL. The 
extended and well-ordered (thus probably rigid) {-hairpins 
from neighbouring subunits contact each other via a stretch of 
C-terminal amphipathic helices termed CH (Fig. 3). This notable 
B-hairpin-helix element (termed the BH element) extends over the 
entire length of the antiporter-like subunits and contributes to the 
stability of the complex, because the only polar interactions between 
NuoM and NuoN are via the tip of the NuoM -hairpin (there are also 
similar interactions between NuoM and NuoL; Supplementary Table 
4). Predictions of secondary structure indicate that there are B-strands 
of similar length between TMs 2 and 3 in most other species, so the 
B-hairpin and second connecting element are likely to be common 
features. 


Proton translocation pathways in subunits NuoL, M and N 
The central positions of Lys TM7 and Lys TM12 in both channels, close 
to the breaks in TM7 and TM12, and the severe effects of their muta- 
tions (Supplementary Table 6), indicate that in complex I, unusually, 
lysine rather than carboxylate residues have a central role in the proton 
pump (one exception is M(Glu 407)). Examination of potential path- 
ways (Supplementary Discussion, section 1) shows that none of the 
channels is solvent-accessible on both sides of the membrane. 

In the first channel, the cavity surrounding Lys TM7 is about 70 A® 
and is lined by 2-5 polar residues (Fig. 4a—c and Supplementary Table 5). 
It is closed from the periplasm by large hydrophobic residues but is 
open to the cytoplasm via a network of polar residues. Cavities between 
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conserved residues involved in contacts between antiporter-like subunits and 
helix HL, as well as between the B-hairpins and TM8. Putative hydrogen bonds 
are indicated. Potential interactions between Glu TM5 and a conserved proline 
from the TM12 loop are indicated by arrows. b, View in the membrane, with 
TM14 also shown (light blue). Tryptophan residues forming hydrogen bonds to 
the backbone at the C terminus of TM14 are indicated. 


subunits L and M, and M and N, are blocked from the periplasm but 
may be accessible from the cytoplasm, possibly allowing an additional 
(but less likely; Supplementary Discussion, section 1) protonation 
route for Lys TM7, by ‘side-entry’ near Glu TMS. 

The cavity in the second channel, near Lys/Glu TM12, is markedly 
larger and more hydrophilic than that in the first one (up to 200 Ae, 
~10 polar residues; Fig. 4a-c and Supplementary Table 5). Access to 
the cytoplasm is closed but the periplasm is accessible via short polar 
networks. 

Many conserved charged and polar residues in the middle of the 
membrane connect the two channels in each subunit (Fig. 4, Sup- 
plementary Fig. 8 and Supplementary Table 5). Notably, in NuoM and 
NuoN, central positions in this connection are occupied by invariant 
essential residues***°*' N(Lys 247) and M(Lys 265), found near the 
TM8 z-bulge. NuoL also contains an invariant residue, His 254, near 
this position. 

Thus, in all antiporter-like subunits, unusually, two half-closed 
symmetry-related channels are linked, forming a single continuous 
proton translocation pathway through each subunit. Consistently, all 
the observed water molecules are either in the channels or in the 
connection (Fig. 4). Other pathways seem to be unlikely (Supplemen- 
tary Discussion, section 2). 


The fourth proton translocation pathway 

An additional channel is apparent at the interface of subunits NuoN, K, J 
and A. A cavity of ~130 A’ between NuoN and NuoK can accommodate 
water molecules coordinated by K(Glu72), N(Thr 160) and N(Ser 156) 
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Figure 4 | Proton translocation channels. a—c, Putative channels in the 
antiporter-like subunits. Polar residues lining the channels are shown as sticks, 
with carbon in dark blue for the first channel, in green for the second channel 
and in orange for connecting residues. d, Channel at the interface of subunits 
NuoN, K and J, with polar residues in cyan. All views are in a similar orientation 
to Fig. la. Approximate proton translocation paths are indicated by blue 


(Fig. 4d). Space for the cavity is provided by an invariant residue, 
K(Gly 76). Access to the periplasm is blocked by large hydrophobic 
residues but the cytoplasm is accessible with only a short constriction 
near K(Leu 83). Indeed, the structure shows a water molecule in the 
middle of the bilayer, between the conserved, essential residues*”** 
K(Glu 36) and K(Glu72). From K(Glu 36), the continuation of the 
pathway is apparent at the interface of TMs 2 and 3 from NuoK and 
NuoJ, reaching the periplasm. Several small cavities here may contain 
water molecules coordinated by conserved polar residues (Fig. 4d). An 
alternative ‘sideways’ pathway from K(Glu36) to the periplasm 
through invariant A(Glu 81) to A(Glu 102) can be considered, but is 
less likely, requiring a more extended network of water molecules. 


Mechanism of proton translocation 


There is no evidence in the structure for any potential electron carriers 
in the membrane domain, for the existence of quinone-binding sites 
in antiporter-like subunits, or for a different role of NuoN (Sup- 
plementary Discussion, section 4), as has been discussed in the 
literature**~°. Structural features combine to indicate strongly that 
complex I operates purely by a conformation-driven mechanism. 

Conformational changes in the hydrophilic domain, which occur 
upon reduction’’, can be transmitted to the membrane-domain sub- 
units NuoH, A, J and K (Supplementary Discussion, section 3). Helix 
HL can relay these changes to TM7 in NuoL, M and N, leading to 
protonation of Lys TM7 upon closer approach of Glu TMS. Recent 
studies have confirmed the essential coupling role of this helix’”** 
(Supplementary Discussion, section 4). 


arrows. Additional, less likely, input channels at the interfaces of subunits are 
paler in colour and labelled “’. Essential charged residues are labelled in red. 
Water molecules resolved in the structure are shown as red spheres. Alanine 
and glycine from the m-bulge in TM8 may coordinate water molecules via 
exposed backbone carbonyls. 


The structure indicates that conformational changes in both half- 
channels of the antiporter-like subunits are coordinated not only by 
helix HL, but by several connecting elements, acting in concert to 
achieve tight coupling of proton translocation events. HL can drive 
TM7 in the first channel, and, on the opposite side of the domain, the 
BH element can drive TM12 in the second channel. The two channels 
can interact both within subunits (via TM8, which contains an invariant 
connecting Lys or His near its flexible kink) and also between subunits, 
because the intra-helical loop of TM12 is in close contact with Glu TM5 
on the neighbouring subunit. Furthermore, Glu TM5 can also interact 
electrostatically with Lys/GluTM12 from a neighbouring subunit, 
because both residues are exposed to the inter-subunit cavities. 

We suggest the following translocation cycle via antiporter-like 
subunits (Fig. 5), assuming that LysTM7 is protonated and that 
Lys/GluTM12 is deprotonated in the oxidized state (meaning the 
state of iron-sulphur cluster N2 and a nearby cluster’’, because most 
clusters are reduced during turnover*’). Conformational changes 
upon reduction move GluTM5 away from Lys TM7, forcing lysine 
to donate its proton into the link between the two half-channels and 
eventually to Lys/GluTM12, which becomes protonated owing to 
local conformational changes, either with or without help from 
Glu TMS from the neighbouring subunit. Upon return to the oxidised- 
state conformation, Glu TM5 moves back, Lys TM7 is protonated from 
the cytoplasm, reloading the pump, and Lys/Glu TM12 ejects its proton 
into the periplasm. Overall, three subunits translocate three protons. 

In the fourth putative channel, a notable feature indicating con- 
formational coupling is the m-bulge on TM3 of NuoJ, which contains a 
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Figure 5 | Proposed mechanism of complex I. a, Oxidised state. b, Reduced 
state. Conformational coupling between electron transfer and proton 
translocation is mediated by helix HL (cytoplasmic side) and the BH element 
(periplasmic side). TM helices are numbered. Crucial charged residues 

(Glu TM5, Lys TM7, Lys/Glu TM12 and Lys/His TM8 from NuoL, M and N, as 
well as K(Glu 72) and K(Glu 36), interacting with J(Tyr 59)) are indicated by 
red or empty circles (Glu) and empty or blue circles (Lys) for unprotonated or 
protonated residues, respectively. In NuoL, M and N, Lys TM7 from the first 
half-channel is assumed to be protonated in the oxidized state. Upon reduction, 
it donates its proton to the connecting Lys/His TM8 and then on to Lys/ 

Glu TM12 from the second half-channel. Lys/GluTM12 ejects its proton into 
periplasm upon return from the reduced to the oxidized state. A fourth proton 
per cycle is translocated at the interface of NuoN, K and J. 


residue that is invariant and important for activity*”’, J(Tyr 59), 
forming a hydrogen bond to K(Glu 36), a central residue of this channel. 
TM3 is the most conserved helix in NuoJ (Supplementary Figs 3a and 
8) and is a hotspot for human mitochondrial disease mutations 
(Supplementary Table 7). Conformational changes, propagating from 
the nearby hydrophilic domain, can move the flexible TM3 of NuoJ 
and J(Tyr 59), resulting in the protonation/deprotonation of K(Glu 36) 
and the translocation of the fourth proton (Fig. 5). Because the four 
channels that are present in the structure account for the measured 
H*/e~ stoichiometry”, any additional direct coupling, as proposed 
recently*™, is unlikely. 

Our knowledge of the structure of the core subunits of complex I is 
now nearly complete (except for subunit NuoH). The effects of many 
mutations, including those causing human diseases, can now be 
understood (Supplementary Tables 6-8). This molecular machine 
has one of the most intricate architectures among known protein 
structures. The redox energy of NADH, binding at the tip of the 
hydrophilic domain, is eventually used in the membrane domain, at 
distances of up to ~300 A away. Our previously published analogy of 
the ‘steam engine of the cell’ is emphasized by the presence of several 
connecting elements, including helix HL, which are powered by the 
redox energy of electron transfer and which drive six symmetrical 
structural domains, plus one asymmetrical domain, resulting in a very 
effective proton pump. 
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METHODS SUMMARY 


The membrane domain of E.coli complex I was purified as previously described", 
except that the detergent cymal-7 was used. Protein was crystallized in the 
presence of lipids using polyethylene glycol (PEG) 4000 and sodium tartrate as 
the precipitant at pH 4.8, with the additives 0.15% sodium cholate and 0.1M 
NDSB-256 (non detergent sulfobetaine-256). Diffraction was improved by a slow 
increase of PEG 4000 concentration to 30%, by dialysis at 4 °C. 

Diffraction data were collected from cryo-cooled crystals (100 K) on beamline 
ID29 at the European Synchrotron Radiation Facility (ESRF) and X06SA at the Swiss 
Light Source (SLS). Crystallographic phases were obtained using selenomethionine- 
labelled protein. Electron density was improved by multi-crystal averaging and 
density modification. The folds of individual subunits and sequence register at the 
initial steps of model building were determined using the positions of anomalous 
density peaks from Se atoms, directionality of helices and density connectivity. A 
combination of manual and automatic model building was used. Data collection and 
refinement statistics are presented in Supplementary Table 1. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Protein purification, crystallization and post-crystallization treatment. Complex 
I from E. coli was purified from strain BL21(DE3) as previously described’. The 
membrane and hydrophilic domains were separated" by treatment with a high 
concentration of Mg**: purified complex I (2.5mgml~') was incubated in 
20mM Bis-Tris (pH 6.0), 5% (v/v) glycerol, 400mM MgCl, 12.5mM NaCl, 
0.5mM CaCl, and 0.25% (w/v) n-dodecyl-B-maltoside (Glycon) for 2h on ice. 
The protein solution was diluted tenfold with buffer A (20 mM Bis-Tris (pH 6.0), 
0.02% cymal-7 (Anatrace)), loaded onto a Mono-S HR 5/5 column and eluted with a 
25 ml gradient, from 10% to 70%, of buffer B (1M NaCl in buffer A). Fractions 
containing the membrane arm were pooled, concentrated in 100-kDa MWCO 
concentrators (Ultracel-100K, Amicon), diluted tenfold with 10 mM sodium acetate 
(pH 4.8), 50 mM NaCl and 0.02% cymal-7, and concentrated to 10-11 mg mit. 

Before crystallization, lipids (1,2-dimyristoleoyl-sn-glycerol-3-phosphocholine 
with E. coli polar lipids at a 3:1 ratio (w/w), Avanti Polar Lipids) were added to 
the protein to 2 mg ml ' final concentration. Crystals were grown by the sitting drop 
method, mixing protein at 1:1 (v/v) ratio with 0.1 M sodium acetate (pH 4.8), 0.3 M 
sodium tartrate (pH 5.0), 9% (w/v) PEG 4000, 0.15% sodium cholate and either 
0.1M NDSB-256 or 0.4% nonyl-glucoside from Detergent Screen HT (Hampton 
Research). Crystals grew at 22 °C as thin plates of up to 200 um in length and 20- 
40 um in thickness. They were harvested after 3 weeks to 4 months, and initially 
diffracted to 7 A resolution. To improve diffraction properties, crystals were slowly 
cooled to 4 °C, placed into micro-dialysis buttons, and mother liquor was exchanged 
by dialysis for 0.1 M sodium acetate (pH 4.8), 0.15 M sodium tartrate (pH 5.0), 30% 
PEG 4000, 0.1% sodium cholate and 12.5% glycerol over a period of 5 days. 

For selenomethionine labelling, a feedback inhibition method** was applied 
during cell growth on M9-derived minimal media with 0.6% malate (optimized 
for maximal yield of complex I) as a carbon source. Purification and crystalliza- 
tion conditions were identical to those for native protein except for the addition of 
10 mM tris(2-carboxyethyl)phosphine (TCEP, Thermo Scientific) to crystalliza- 
tion solutions. 

Because the crystals were obtained at low pH and it is known that E. coli complex 

I dissociates into hydrophilic and hydrophobic domains after prolonged incubation 
at pH outside the 5.5-6.5 range”, we assayed the purified enzyme for NADH:DQ 
activity at pH 4.8. The activity is lower than at optimal pH”, but is still substantial 
(~4 mol NADH oxidised per min per mg protein). It is still fully inhibited by 
piericidin A, indicating that enzyme is functional and that there are no drastic 
effects of pH on the conformation of the entire complex or individual domains. 
Data collection and processing. Data were collected at 100K with an ADSC 
Q210 detector at beamline ID29 at the European Synchrotron Radiation Facility 
anda Marmosaic 225 detector on the high-resolution diffractometer of beamline 
X06SA at the Swiss Light Source. Image data were processed with MOSFLM and 
SCALA from the CCP4 suite®’. 
Model building and refinement. Crystals grew in space group P1 with two mol- 
ecules in the unit cell and 70% solvent content. As is the case with orthorhombic 
crystals’, triclinic crystals were non-isomorphous and diffracted anisotropically, 
with the worst diffraction along the a axis and comparable resolution along axes b 
and c; the c axis being systematically slightly better. Using the Diffraction 
Anisotropy Server (http://services.mbi.ucla.edu/anisoscale/)”’, the best native data 
set was anisotropically scaled and truncated to 3.4 A, 3.0 A and 3.0A resolution, 
where the F/a ratio drops to ~2.6-2.8 along the a*, b* and c* axes, respectively 
(scaling 2, Supplementary Table 1), resulting in better-quality electron density 
maps. As a control, the final model was also refined against non-truncated raw 
data at 3.2 A resolution, with excellent statistics (scaling 1, Supplementary Table 1). 
Crystal packing resembles that observed previously in space group P2,22), with 
strong interactions between hydrophilic surfaces and no visible side contacts, 
explaining the weaker diffraction along the a axis (Supplementary Fig. 2). It is 
possible that side contacts are mediated by poorly ordered lipids. 

Several anomalous diffraction data collected from selenomethionine-containing 
crystals were used for phasing (Supplementary Table 1). Despite challenges pre- 
sented by anomalous data collection in the P1 space group, several good quality 
MAD data sets were obtained using a low-intensity beam and, in some cases, 
inverse-beam geometry. Initially, the positions of more than 140 Se atoms in the 
unit cell were determined and refined automatically in PHASER”, using molecular 
replacement (MR) phases from the backbone model of the membrane fragment of 
E. coli complex I (PDB code 3M9C, ref. 11). The correspondence of the Se positions 
with peaks on the anomalous difference map was verified manually. When such 
maps were calculated with phases improved by density modification, the positions 
of 80 Se atoms per molecule were revealed. Experimental MAD phases calculated 
with these Se atoms in SHARP” gave a figure of merit of 0.23 in the resolution range 
between 20 A and 3.5 A, with the anomalous phasing power from peak, inflection 
and remote data sets being 0.608, 0.368 and 0.195, respectively. Isomorphous 
phasing power for inflection and remote data sets was 0.158 and 0.198, respectively. 


Phases were improved and extended by multi-crystal averaging with histogram 
matching and solvent flattening in the CCP4-suite program DMmulti. The 
molecular masks and transformation matrices were obtained from MR solutions. 
Four data sets were used: two from two P1 crystals (one native and one with Se 
MAD phases, Supplementary Table 1), and two phased data sets in the P2212; 
space group from our previous study (Supplementary Table 1 in ref. 11, crystals 
Nativel with TaBrl MAD phases and Native3 with TaBr3 SAD phases). 
Additionally, a solvent-flipped map was calculated in CNS” using MAD data from 
a single SeMet-containing crystal. The maps were of excellent quality; they had 
better connectivity in DMmulti maps and, occasionally, better side-chain density 
in CNS maps, so both maps were used during model building. 

Individual subunit folds and sequence register at the initial steps of model 

building were deduced on the basis of Se atom positions, directionality of helices 
determined using the ‘find_helices_strands’ procedure in PHENIX®’, and local 
density averaging in O (ref. 54), as well as density connectivity. An initial model 
was built by combining manual building in O and Coot v.0.6.1 (ref. 55) with 
automatic model building in Buccaneer v.1.5 (ref. 56). The model was rebuilt 
manually during cycles of refinement. The structure was refined in PHENIX. A 
subset of 2% of structure factors was omitted from refinement and used for cross- 
validation (calculation of R¢ee). Simulated annealing was used in the early stages; 
secondary structure and non-crystallographic symmetry restraints were used 
throughout the refinement and TLS (translation, libration and screw-rotation 
displacements parametrization) was used during the final refinement. For initial 
refinement, Se MAD experimental or blurred phases from multi-crystal aver- 
aging were used, whereas final refinement was only against native data. Several 
well ordered internal water molecules were modelled on the basis of their well 
defined electron density, presence of coordinating residues and low B-factor upon 
refinement. The final model includes 1,952 residues, one cymal-7 molecule, eight 
water molecules and nine fragments of lipid aliphatic chains. The structure was 
validated in PROCHEK” and MOLPROBITY* and was found to be of better- 
than-average quality for the resolution. As calculated with PROCHEK, 87.0/12.3/ 
0.3/0.0% of residues are in preferable/allowed/generously allowed/disallowed 
Ramachandran regions, respectively. 
Bioinformatics. Searches for structural analogs in the PDB were performed using 
the SSM server (www.ebi.ac.uk/msd-srv/ssm/cgi-bin/ssmserver). In the case of 
NuoK and NuoA, as expected for small transmembrane proteins, similar folds 
could be found in the PDB, but without any significant sequence similarity. 
Structure-based multiple sequence alignments were performed in CLUSTALW 
v1.83 (ref. 59) with the profile alignment option. Homology models of the 14 
conserved TM helices of Mrp subunits A and D from Bacillus subtilis and Bacillus 
pseudofirmus were built with MODELLER 9v7 (ref. 60) using structures of the E. 
coli subunits NuoL and NuoM, respectively, as templates. Sequences for model- 
ling were aligned with CLUSTALW, using 30 sequences for NuoL or NuoM in 
addition to Mrp sequences and the structure-based profile alignment option. 
Figures were prepared in PyMol. 
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self-induced meiotic instability 
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DNA double strand breaks (DSBs) in repetitive sequences are a 
potent source of genomic instability, owing to the possibility of 
non-allelic homologous recombination (NAHR). Repetitive 
sequences are especially at risk during meiosis, when numerous 
programmed DSBs are introduced into the genome to initiate meiotic 
recombination’. In the repetitive ribosomal DNA (rDNA) array of 
the budding yeast Saccharomyces cerevisiae, meiotic DSB formation 
is prevented in part through Sir2-dependent heterochromatin 
formation’*. Here we show that the edges of the rDNA array are 
exceptionally susceptible to meiotic DSBs, revealing an inherent 
heterogeneity in the rDNA array. We find that this localized DSB 
susceptibility necessitates a border-specific protection system con- 
sisting of the meiotic ATPase Pch2 and the origin recognition com- 
plex subunit Orcl. Upon disruption of these factors, DSB formation 
and recombination increased specifically in the outermost rDNA 
repeats, leading to NAHR and rDNA instability. Notably, the 
Sir2-dependent heterochromatin of the rDNA itself was responsible 
for the induction of DSBs at the rDNA borders in pch2A cells. Thus, 
although the activity of Sir2 globally prevents meiotic DSBs in the 
rDNA, it creates a highly permissive environment for DSB forma- 
tion at the junctions between heterochromatin and euchromatin. 
Heterochromatinized repetitive DNA arrays are abundant in most 
eukaryotic genomes. Our data define the borders of such chromatin 
domains as distinct high-risk regions for meiotic NAHR, the pro- 
tection of which may be a universal requirement to prevent meiotic 
genome rearrangements that are associated with genomic diseases 
and birth defects. 

To understand better the mechanisms that protect repetitive DNA 
from meiotic NAHR, we analysed the single tandem rDNA array of 
budding yeast. Meiotic DSB formation and recombination in the rDNA 
are repressed by the histone deacetylase Sir2 (refs 2, 3). Additionally, 
Pch2, a widely conserved meiosis-specific ATPase, suppresses meiotic 
recombination in the rDNA by an unknown mechanism**. We used 
clamped-homogenous electric field (CHEF) electrophoresis and 
Southern blotting of excised rDNA arrays to address whether Pch2 
regulates meiotic DSB formation in the rDNA. Consistent with pre- 
vious results~*, the level of full-length rDNA arrays that remained 8h 
after meiotic induction was significantly reduced in sir2A mutants 
compared to wild-type cells, indicating increased DSB formation 
(Fig. la and Supplementary Fig. 1a). By contrast, no such reduction 
occurred in pch2A mutants, although we observed a tenfold increase in 
crossover recombination across the rDNA array (Fig. la, b). Because 
small changes in array length would not be detectable by the CHEF gel 
assay, we wondered whether DSB formation in pch2A mutants 
occurred specifically in the outermost rDNA repeats. To test this 
possibility, we generated pch2A strains carrying a URA3 insertion at 
defined positions in the rDNA array (Fig. 1c) and analysed the rDNA 
repeat units that directly flank these insertions by Southern blotting. 
We observed a strongly DSB-prone site in repeat 1 and weak DSB 


formation in repeat 3, whereas no DSB formation was detectable in 
repeat 10 of the approximately 100 rDNA repeats (Fig. 1d). Thus, 
pch2A cells undergo increased meiotic DSB formation predominantly 
in the outermost rDNA repeats. 

To determine whether PCH2 suppresses DSB formation only within 
the rDNA, or also in other regions of the genome, we first analysed a 
chromosomal fragment spanning the junction between single-copy 
DNA and rDNA ina pch2A mutant by Southern blotting. We observed 
additional, strong DSB formation in the adjacent single-copy sequences 
(Fig. le and Supplementary Fig. 1b), which were previously shown to 
have exceptionally low levels of meiotic DSBs in PCH2 cells” (Fig. 1f). 
The observed break sites behaved similarly to known meiotic DSBs’; 
they were induced during meiosis in dmc1A and DMC1 cells (Fig. 1d, e 
and Supplementary Fig. 1c), depended on the meiotic DSB machinery 
(Supplementary Fig. 1d)’, promoted meiotic recombination (Sup- 
plementary Fig. le) and occurred in gene promoters (Fig. le and 
Supplementary Fig. 1b). Indeed, even the DSBs observed in repeat 1 
mapped to the promoter of a gene (TARI) that is encoded in every 
rDNA repeat’? (Fig. le). Genome-wide analysis of DSBs° in pch2A cells 
showed that strong DSB induction occurred in 30-50-kilobase (kb) 
regions of single-copy sequence abutting both sides of the rDNA 
(Fig. 1f). Increased DSB formation was also observed close to other 
heterochromatic regions (telomeres and HML), whereas the DSB land- 
scape elsewhere in the genome was not markedly altered (Supplemen- 
tary Figs 1f, g, 2 and Supplementary Table 1). In contrast to pch2A 
mutants, the loss of SIR2 did not lead to increased DSB formation 
adjacent to the rDNA array (Fig. 1f). Thus, Pch2 represses recombina- 
tion in the rDNA at the level of DSB formation, but in a manner distinct 
from Sir2. 

We investigated whether the increased DSB formation in the 
outermost rDNA repeats in pch2A mutants (Fig. 1d) resulted in a local 
increase in rDNA recombination. We measured recombination rates 
using flanking markers to the left and right of the rDNA together with 
a collection of single URA3 insertions tiling inwards from the left side 
of the rDNA (Fig. 1c). Analysis of a URA3 insertion in the centre of the 
rDNA (inserted next to repeat 49 of 99) indicated that recombination 
occurred in a symmetrical pattern. Notably, about 80% of the recom- 
bination events in the left half of the rDNA occurred within the first ten 
repeats from the left border (Fig. 1g and Supplementary Table 2), with 
about 30% taking place within repeat 1. Thus, there is a strong bias for 
recombination in the rDNA repeats very close to the array border. 

Because recombination in repetitive DNA can lead to NAHR, we 
selected tetrads of pch2A mutants that had undergone recombination 
in the rDNA, and determined the resulting rDNA repeat number 
between the URA3 insertion and the left rDNA boundary. In 70% 
(n = 47) of the tetrads investigated from different URA3 integrants, 
we detected changes in repeat number, ranging from 1 to 19 repeats 
(Fig. li, j and Supplementary Table 2), demonstrating that rDNA 
crossovers in pch2A cells are frequently associated with NAHR. 
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Figure 1 | Ribosomal-DNA-associated DSB formation and recombination. 
a, CHEF analysis of the rDNA of meiotic dmc1A (H5217), sir2A dmc1A 
(H2953) and pch2A dmc1A (H5216) cells. The schematic shows the analysed 
Xhol restriction fragment. A dmc1A mutation was used to prevent DSB repair. 
The mean (+ s.e.m.) of five experiments is shown. Significance was assessed by 
one-tailed Student’s t-test: dmc1A versus sir2A dmc1A, P value = 0.00122; 
dmc1A versus pch2A dmcl1A, P value = 0.254; pch2A dmc1A versus 

sir2A dmciA, P value = 0.00216. b, Schematic of markers inserted in unique 
single-copy sequences within 500 base pairs (bp) of the rDNA, and crossover 
rates in wild-type (WT; H3026; n = 467) and pch2A (H3027; n = 186) cells. 

c, Schematic indicating marker locations in the rDNA used in d and g. URA3 
markers were inserted in the NTS 1/2 region of the indicated repeats. d, Southern 
blot for restriction fragments containing the indicated insert-associated rDNA 
repeat units from pch2A dmc1A strains H5622 (repeat 1), H5636 (repeat 3) and 
H5706 (repeat 10). DNA was digested with PflMI and Sall and probed for the 
unique rDNA insertion. The YCR047C probe (on DNA digested with HindIII) 
was a positive control for DSB formation. e, Southern blot of the left rDNA flank, 
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including the outermost rDNA repeat, in dmc1A (H5583) and pch2A dmc1A 
(H5622) cells (SalI and Nrul digest; probe YLR152C). Positions of open reading 
frames are shown schematically alongside the Southern blot. f, Enrichment 
profile of ssDNA in chromosome XII in dmc1A (H118, light grey), pch2A dmc1A 
(H2629, black) and sir2A dmc1A (H2953, dark grey) cells. Arrowheads indicate 
>twofold increased DSB formation in pch2A dmc1A compared to dmc1A cells. 
g, Tetrad analysis of pch2A URA3-rDNA-insertion strains H4611 (repeat 1), 
H4613 (repeat 3), H3823 (repeat 10), H4612 (repeat 12), H3820 (repeat 29) and 
H3821 (repeat 49; Supplementary Table 2). Recombination rates between URA3 
and rDNA-flanking markers are shown in relation to the physical URA3 
positions in the rDNA. h, Relative contribution of each measured interval 
indicated in g to total rDNA recombination (percentage of crossovers (CO) per 
kb of interval). i, Incidence of changes in rDNA repeat number between the 
URA3 insertion and the rDNA boundary in pch2A tetrads that had undergone 
crossover recombination. j, CHEF analysis of two tetrads that have undergone 
unequal recombination with parental controls. DNA was digested with XhoI 
and probed with NTS1. Par., parental controls; Rec., recombinants. 
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However, the prevalence of allelic recombinants (30%) and the fact 
that changes in repeat number encompassed less than 20% of the 
approximately 100-110 rDNA repeats in our strains indicate that 
the homology search for DSB repair in the outermost rDNA repeats 
in pch2A cells is restricted to close neighbours. Notably, the distri- 
bution of changes in repeat number closely matched the pattern of 
crossover events (Fig. 1h, i). This congruence indicates that the spread 
in the crossover distribution (Fig. 1g, h) can largely be accounted for by 
non-allelic exchanges between rDNA repeats originating from DSBs in 
the outermost repeats of the array. Finally, although rDNA exchanges 
occurred at a much lower frequency in wild-type cells, they were also 
associated with NAHR (Supplementary Table 2), indicating that Pch2 
primarily acts to prevent NAHR by suppressing DSB formation. These 
results establish the rDNA borders as high-risk regions for meiotic 
NAHR. 

To determine how Pch2 suppresses DSB formation near the rDNA, 
we measured the chromosome association of DSB-related factors at 
the time of DSB formation. The three essential DSB factors’*”” that we 
were able to analyse by chromatin immunoprecipitation (ChIP), 
Recl14, Mer2 and Mrel11, were specifically enriched near the rDNA 
and HML in pch2A cells, mirroring the changes in DSB formation 
(Fig. 2 and Supplementary Fig. 3). We then investigated whether the 
regional exclusion of DSB factors could be explained by local depletion 
of the DSB-promoting chromosome-axis protein Hop], the cytological 
distribution of which is affected by Pch2 (refs 4, 13). Although Hop1 
binding was slightly increased near the rDNA and HML in pch2A cells, 
it was abundant even in wild-type cells, indicating that Pch2 does not 
regulate the initial chromosomal recruitment of Hop]. Rather, the 
differences in Hop1 binding that we observed might reflect an effect 
of Pch2 on chromosome structure’’. Finally, although DSBs are 
enriched in promoters containing histone H3 lysine 4 trimethylation 
(H3K4me3)", we saw no difference in the genome-wide levels of this 
modification with or without Pch2 (Fig. 2c and Supplementary Figs 3d 
and 4), indicating that Pch2 does not influence this chromatin modi- 
fication. These findings indicate that Pch2 specifically blocks the stable 
recruitment of DSB factors to prevent local DSB formation. 

We sought to identify proteins that collaborate with Pch2 in prevent- 
ing rDNA-proximal DSBs. A yeast two-hybrid screen isolated a fragment 
of the Orcl protein, containing its ATPase domain, as a Pch2 interactor 
(Fig. 3a). This interaction was confirmed by co-immunoprecipitation 
(Fig. 3b and Supplementary Fig. 5a). Orcl is a component of the con- 
served origin recognition complex that has several important chromo- 
somal functions, including the loading of the replicative helicase’. 
Impairing Orcl protein levels by a temperature-sensitive orc1-161 muta- 
tion'® (Supplementary Fig. 5b) triggered DSB formation in the rDNA 
flanking regions, similarly to the loss of PCH2 (Fig. 3c, d). DSB formation 
near the rDNA occurred even at a temperature (23 °C) that is permissive 
for pre-meiotic DNA replication and spore viability (Fig. 3c-e and 
Supplementary Fig. 5c, d). Similarly, we saw increased DSB levels near 
the rDNA in an orcl mutant lacking the N-terminal bromo-adjacent 
homology (BAH) domain that is required for the chromatin-silencing 
function of Orcl (ref. 17), but is dispensable for DNA replication (Fig. 3f 
and Supplementary Fig. 5e, f). These data indicate that the regulatory 
roles of Orcl in DSB formation and bulk DNA replication are separable, 
although we cannot rule out that the analysed orc] mutations affect 
rDNA replication locally. During meiosis, Pch2 concentrates in the 
nucleolus, the organelle assembled on the rDNA array*. In orc1-161 
cells, the recruitment of Pch2 to the nucleolus was impaired, despite 
normal levels of cellular Pch2 (Fig. 3g, h). Both Pch2 and Orcl belong 
to the AAA™ family of ATPases that often function as multimeric 
complexes'’, and we found that the ATPase activity of Pch2 was 
required to prevent rDNA-proximal DSBs (Supplementary Fig. 5g). 
These data define a role for Orcl in the nucleolar recruitment and 
possible activation of Pch2 to prevent local DSB formation. 

To find out whether the specific DSB activity at the rDNA borders in 
pch2A mutants was linked to the presence of the rDNA itself, we 
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Figure 2 | Association of the meiotic DSB machinery near the rDNA. 

a, Enrichment of ssDNA in a region flanking the right border of the rDNA on 
chromosome XII (Saccharomyces genome database (SGD) coordinates) in 
dmc1A (H118, grey) and pch2A dmc1A (H26239, black) cells. Arrowheads indicate 
>twofold increased DSB formation in pch2A dmc1A compared to dmciA cells. 
b, ChIP-chip analysis in same region as in a, for the proteins Rec114—13Myc (first 
panel; wild type (H4890), pch2A (H4893)), Mer2-5Myc (second panel; wild type 
(H5917), pch2A (H5916)), Mrel1-13Myc (third panel; wild type (H5547), pch2A 
(H5947)) and Hop] (fourth panel; wild type (H119), pch2A (H2817)), in wild- 
type (grey) and pch2A (black) cells. c, Average enrichment for the different PCH2 
and pch2A ssDNA and ChIP data sets (see a, b and Supplementary Fig. 4) within 
the 50 kb flanking the right rDNA border (see Methods for genomic coordinates). 
The genome-wide average is indicated by the dotted line. 


deleted the rDNA array from its genomic location. In these strains, 
loss of PCH2 no longer allowed DSB formation in the flanking regions 
(Figs 4a, b), demonstrating an intrinsic DSB-promoting activity in the 
rDNA. To investigate whether the rDNA was sufficient to promote 
DSB formation, we created a translocation between chromosomes II 
and XII that exchanged the rDNA-distal portion of chromosome XII 
with a portion of chromosome II (Fig. 4c). In these strains, DSB levels 
were no longer increased in the former right flank of the rDNA (now 
flanked by chromosome II sequences; Fig. 4d), but notably, increased 
DSB formation was observed on the chromosome II sequences that, 
after translocation, were flanking the rDNA (Fig. 4e). Thus, in the 
absence of PCH2, the rDNA is necessary and sufficient to promote 
DSB formation. 
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Figure 3 | Orcl and Pch2 collaborate to suppress DSB formation. 

a, Schematic of Pch2 and Orcl proteins, indicating Orcl clones identified by 
yeast two-hybrid screen. b, Co-immunoprecipitation (IP) between 
haemagglutinin-tagged Pch2 (3HA-Pch2) and Orcl in wild-type (H119) and 
3HA-PCH2 (H3463) cells. Fpr3 and histone H3 are controls for the nucleolar 
and chromosomal fractions, respectively. c, Southern blots of the right rDNA 
flank and YCRO47C in meiotic dmc1A (H118) and orcl-161 dmc1A (H4952) 
cells, grown at the indicated temperatures. Arrowheads indicate broken DNA 
fragments. d, Profiles of ssDNA in the region flanking the right rDNA border in 
orcl-161 dmc1A (H5137, black) and dmc1A (H118, grey) cells, grown at 23 °C. 
e, DNA-content analysis of meiotic dmc1A (H118) and orcl-161 dmc1A 
(H5137) cells grown at 23 °C. 2C and 4C refer to unreplicated and replicated 
diploid DNA contents, respectively. f, Southern blots of the right rDNA flank 
and YCR047C in ORCI dmc1A (H5838) and orcl-Abah dmc1A (H5865) cells. 
g, Immunofluorescence of chromosome spreads stained for Pch2 (HA, green), 
Nop! (nucleolar marker, red) and DNA (blue) in 3HA-PCH2 (H3463) and 
3HA-PCH2 orc1-161 (H5033) cells at 3 h after meiotic induction at 23 °C. Scale 
bar, 2 um. h, Western blot analysis showing Pch2 (HA) expression in 3HA- 
PCH2 (H3463) and 3HA-PCH2 orc1-161 (H5033) cells. Pgk1 is used as loading 
control. 


The rDNA is assembled into specialized, Sir2-dependent chro- 
matin, and we asked how this chromatin state influenced DSB forma- 
tion at the rDNA boundaries. Notably, loss of Sir2 protein or loss of 
its deacetylase activity’’ largely eliminated DSB formation in the 
rDNA flanking regions in pch2A mutants (Fig. 4f, g). Thus, although 
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Figure 4 | Ribosomal-DNA chromatin promotes DSB formation. 

a, Schematic of the rDNA deletion strain (rdnAA) and ssDNA profiles of the 
region flanking the right rDNA border in pch2A dmc1A (H2629, black) and 
pch2A rdnAA dmclA (H4737, grey) cells. b, Southern blot analysis of the right 
rDNA flank in dmc1A (H118), pch2A dmc1A (H2629), rdnAA dmc1A (H4736) 
and pch2A rdnAA dmc1A (H4737) cells. Arrowheads indicate broken DNA 
fragments. c, Strategy used to generate chromosomal translocations between 
chromosomes XII and II. d, e, Profiles of ssDNA in strains containing the XILII 
translocation. In d, the depicted region is next to the rDNA in pch2A dmc1A 
(H2629) cells (dark blue) and next to the left arm of chromosome II in 
peh2A t(ILX1I) dmc1A (H4798) cells (light blue). In e, the depicted region is 
located on chromosome II in pch2A dmcl1A cells (orange) and next to rDNA in 
peh2A t(XILI) dmc1A cells (dark red). f, Southern blot of the left rDNA flank 
(HindIII digest; ARS1216 probe) and of YCRO47C in dmc1A (H118), 

pch2A dmciA (H2629), sir2A dmc1A (H2953) and pch2A sir2A dmc1A 
(H3038) cells. Positions of open reading frames are shown schematically 
alongside the Southern blot. g, Southern blot of the right rDNA flank and 
YCRO47C in pch2A sir2A dmc1A (H3262), pch2A sir2A dmc1A leu2::SIR2 
(H3261) and pceh2A sir2A dmc1A leu2::sir2-345 (H3282) cells. Arrowheads 
indicate broken DNA fragments. 


Sir2-dependent heterochromatin suppresses meiotic DSBs within the 
rDNA array (refs 2, 3 and Fig. 1a), it has a profound DSB-promoting 
effect on the rDNA borders that is counteracted by Pch2 and Orcl. It is 
also notable that Sir2 itself localizes Pch2 to the nucleolus’, reflecting 
an elegant coupling mechanism that maintains meiotic stability across 
the entire rDNA. The double dependence of Pch2 on Sir2 and Orcl 
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may promote Pch2 enrichment at the nucleolus, analogous to the 
bimodal recruitment mechanisms that restrict localization of Aurora 
B and shugoshin to centromeres”. 

Although repeat-associated chromatin marks differ substantially 
between organisms and even among individual loci”’, the assembly of 
heterochromatin on repetitive DNA arrays is a common strategy to 
protect the genome against destabilization caused by errors in meiotic 
recombination’. Our results establish borders between heterochromatin 
and euchromatin as potential high-risk regions for meiotic DSB forma- 
tion and NAHR, and reveal the existence of a secondary border-specific 
system that shields against these events. Buffer zones like those estab- 
lished by Pch2 and Orcl may need to be broad, because even DSBs 
adjacent to repetitive DNA can trigger NAHR™. Given the prominent 
presence of repetitive DNA arrays in genomes ranging from yeast to 
man’, we propose that mechanisms that limit DSB activity around 
repetitive DNA might be a widespread phenomenon. 


METHODS SUMMARY 


Yeast strains were of the SK1 background and are listed in Supplementary Table 3. 
Analysis of single-stranded DNA profiles and ChIP-chip analysis were performed 
as previously described*”*. These and other standard techniques used are detailed 
in the Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Yeast strains and two-hybrid analysis. All yeast strains used in this study were 
constructed in the SK1 background and are listed in Supplementary Table 3. 
Epitope tags and gene disruptions were introduced by standard PCR-based trans- 
formation. The orcl-Abah mutant was generated using a plasmid encoding a 
truncated version of Orcl (amino acids 235-914; pSPB1.48, gift from S. P. 
Bell**). To create URA3 insertions in the rDNA, cells were transformed with a 
pRS306-NTS1/2 plasmid linearized with Sphl (this plasmid contains a 2341-bp 
fragment harbouring the intergenic rDNA sequences, NTS1 and NTS2, ligated into 
the BamHI and EcoRI sites of pRS306). Insertion sites in the rDNA were mapped 
by CHEF gel analysis using a unique Xhol site in the inserted sequence (XhoI does 
not cut in the rDNA), and suitable clones were selected for further analysis. SK1 
strains lacking the rDNA array were generated as described in ref. 25. Briefly, cells 
were transformed with a very high-copy rDNA plasmid (pRDN-hyg:: URA3::leu2-8) 
carrying a recessive point mutation that confers resistance to hygromycin”’. After 
selection on hygromycin, a clone was selected that had lost all but three repeats of 
the rDNA array through spontaneous deletion, as determined by CHEF gel analysis. 
The remaining rDNA copies were subsequently deleted by conventional gene 
disruption using a HIS3 deletion cassette. Complete deletion of the rDNA array 
was confirmed by Southern blotting. Chromosomal translocations between chro- 
mosomes XII and II were generated essentially as previously described”. Briefly, 
plasmids containing a promoter-less ADE2 gene adjacent to a loxP site (loxP- 
ADE2::natMX4) and a GPD promoter (pGPD) with an adjacent loxP site 
(pGPD-loxP::hphMX4; both plasmids were gifts from N. Hunter) were integrated 
at YLR162W-A and LYS2, respectively. After induction of Cre recombinase from a 
pGAL-Cre plasmid (N. Hunter), cells were selected that had undergone transloca- 
tion between LYS2 (chromosome II) and YLRI62W-A (chromosome XII). 
Translocation was confirmed by Southern blot analysis. To identify interactors 
of Pch2 by two-hybrid screen, full-length PCH2 was amplified from genomic DNA 
and the intron was removed by site-directed mutagenesis. The Pch2 coding 
sequence was then cloned into pGBDU-C1, and the resulting bait plasmid was 
used to screen libraries in all three reading frames”. 

Synchronous meiosis. Cells were grown for 24h in yeast peptone dextrose (YPD) 
at 23°C then diluted in BYTA medium (1% yeast extract, 2% tryptone, 1% 
potassium acetate, 50mM potassium phthalate) to a optical density at 600 nm 
(OD609) of 0.3 (OD¢09 = 0.5 for orcl-161, rdnAA and XII; translocation strains), 
and grown for 16h at 30°C (or for 18h at 23°C in the case of temperature- 
sensitive strains). After two washes in water, cells were diluted into SPO medium 
(0.3% potassium acetate) at OD¢o9 = 1.9 and incubated at 30°C unless otherwise 
stated. 

Isolation of ssDNA. For ssDNA analysis”’”°, about 10° cells were fixed in 70% 
ethanol at —20 °C at 0 hand 5h after induction of meiosis. After spheroplasting in 
sorbitol buffer (1 M sorbitol, 1% B-mercaptoethanol, 0.2 mgml' zymolyase, and 
0.1M EDTA, pH 7.4), cells were lysed in NDS buffer (0.6% SDS, 300 mM EDTA, 
10 mM Tris-HCl, pH 9.5). After treatment with proteinase K (0.25 mg ml ~ 1) and 
RNase A, the DNA was digested with EcoRI and ssDNA was then enriched by 
adsorption to BND-cellulose and eluted using 1.8% caffeine. This enriched ssDNA 
was subsequently used for microarray analysis. For this, 1.5 1g of the respective 0-h 
and 5-h ssDNA samples was labelled with Cy3-dUTP or Cy5-dUTP (GE 
Healthcare) by random priming without denaturation using 4 jug of random primer 
(Integrated DNA Technologies) and 10 units of Klenow (New England Biolabs). 
Western blotting and immunoprecipitation. For western blotting, 5 ml of meiotic 
cells were harvested at the indicated time points and resuspended in 5% trichloro- 
acetic acid. After incubation on ice for 10 min, samples were washed in acetone and 
dried overnight. Samples were lysed by bead beating in a FastPrep FP120 (Thermo 
Scientific) in TE lysis buffer (10 mM Tris (pH 7.5), 1mM EDTA, 2.75 mM dithio- 
threitol). SDS loading buffer (3X) was added and the pH of the sample was adjusted 
to neutral by addition of 1M Tris (pH 8.0). Samples were separated by standard 
polyacrylamide gel electrophoresis. The following antibodies were used: anti-ORC 
(1108, 1:1,000, gift from S. P. Bell), anti-Fpr3 (1:1,000, gift from J. Thorner), anti-HA 
(3F10, 1:1,000, Roche), anti-Pgk1 (1:1,000, Invitrogen) and anti-histone H3 (1:1,000, 
Abcam). For immunoprecipitations, 50 ml of meiotic cells were harvested 3 h after 
induction of meiosis. Cells were diluted in 2X lysis buffer (20 mM HEPES (pH 7.5), 
4mM MgCl, 0.6 M glutamic acid, 0.32 M sorbitol, 4% glycerol, 0.5% Triton X-100) 
containing protease inhibitors, and lysed by bead beating. Extracts were sonicated 
and cleared by centrifugation. After removal of one tenth of the extract for an input 
sample, extracts were immunoprecipitated with 2 pl anti-HA (3F10, Roche) for 
3HA-Pch2, 2 pl anti-ORC (1108) and 2 ll anti-Fpr3, in combination with 20 ll of 
a50% slurry of GammaBind-Sepharose beads (GE Healthcare) for 16 h at 4 °C. After 
five washes in 1X lysis buffer, 1 SDS loading buffer was added to the beads, and 
samples were analysed by western blotting with the indicated antibodies. 
Chromatin immunoprecipitation. Meiotic cells (25 ml) were harvested 3 h after 
induction of meiosis and fixed for 15 min in 1% formaldehyde. The formaldehyde 


was quenched by addition of 125mM glycine. Samples were processed as previ- 
ously described”*. Before immunoprecipitation, one tenth of the sample was 
removed as input sample. The antibodies used for immunoprecipitation were: 
2 ul anti-Myc (9E11, Abcam; for Recl14-13Myc, Mer2-5Myc and Mrell- 
13Myc), 2ul anti-Hopl (gift from N. Hollingsworth), 2] anti-histone-H3 
(AB1791, Abcam) and 2 wl anti-H3K4me3 (AB8580, Abcam), in combination 
with 20 pl of a 50% slurry of GammaBind-Sepharose beads (GE Healthcare). 
Half of the ChIP sample and one tenth of the input were labelled with Cy3- 
dUTP or Cy5-dUTP (GE Healthcare) as described in the ssDNA protocol, with 
the difference that the DNA was denatured for 5 min at 95 °C before the extension 
reaction. 

Microarray analysis. After removal of unincorporated dyes, Cy3- and Cy5- 
labelled samples were hybridized to custom 4 X 44K tiled genomic yeast micro- 
arrays (Agilent Technologies) for 16h at 65°C. Levels of Cy3 and Cy5 were 
calculated with the Agilent Feature Extractor CGH software. Background normal- 
ization, log, ratios for each experiment and scale normalizations between experi- 
ments were calculated with the sma package in R (v2.1.0, http://www.r-project. 
org)”’”°. Each data set is an average of two experiments. For comparison between 
isogenic wild-type and pch2A cells, data sets were scale-normalized. To analyse the 
distribution of H3K4me3, we normalized the enrichment to that of total histone 
H3 generated from the same extracts, by subtracting the log, ratios. To measure 
the average ssDNA or ChIP enrichment in different chromosomal regions, the 
following SGD coordinates were analysed, on the basis of the positions of available 
array features: 

50 kb right of the rDNA: XII, 490,531-540,530. 

50 kb left of the rDNA: XII, 401,371-451,370. 

Rest of chromosome XII: XII, 1-401,370 and 540,531-1,078,177. 

First 100 kb of chromosome III: II, 1-100,000. 

Rest of chromosome III: III, 100,001-316,620. 

Chromosome VIII: VII, 1-562,643. 

Chromosome spreads and immunofluorescence. Meiotic cells were spread as 
described previously*". Cells were spheroplasted at 37 °C in solution 1 (2% potassium 
acetate, 0.8% sorbitol, 10mM dithiothreitol, 130mg ml! zymolyase 100T 
(Seikagaku)). Solution 2 (100mM MES (pH6.4), 1mM EDTA, 0.5mM MgCh, 
1M sorbitol) was added to stop spheroplasting. Spheroplasted cells (15 pil) were 
fixed with 30 ul of fixative solution (4% paraformaldehyde, 3.4% sucrose) and lysed 
with 60 p11 1% lipsol. After addition of 60 1] fixative solution, cells were spread using a 
glass rod. After drying, the slides were blocked in blocking buffer (0.2% gelatine, 0.5% 
BSA in PBS) and stained with the following antibodies: anti-HA (3F10, 1:500 
dilution, Roche), and anti-Nop1 (1:500 dilution, Encor Biotechnology). 

CHEF gel electrophoresis and Southern blotting. Chromosome fragments for 
CHEF analysis were prepared by restriction digest in agarose plugs. Briefly, 20 ml 
of meiotic cells were killed by addition of sodium azide (0.1% final w/v), pelleted 
and stored on ice for the duration of the time course. Cell pellets were washed twice 
in CHEF-TE (10mM Tris-HCl (pH 7.5), 50mM EDTA) and resuspended in 
300 pl CHEF-TE. Tubes were individually treated as follows: 4 ul zymolyase 
T100 (10 mg ml ') was added and the mix was incubated at 42 °C for 30s before 
addition of 500 ul low-melting-point agarose (1.5% SeaPlaque GTG, 125mM 
EDTA) at 42°C. Gel plugs (90 tl) were allowed to harden on ice in disposable 
plug molds (Bio-Rad) and incubated overnight at 37 °C in 300 pil LET (10 mM Tris 
(pH 7.5), 500 mM EDTA) per plug. Plugs were deproteinized overnight at 50 °C in 
200 pl NDS-PK (LET, 1% N-lauroylsarcosine, 1 mg ml~ 7 proteinase K (Amresco)) 
per plug. Proteinase K was inactivated by incubating plugs for 1 h at 4 °C in CHEF- 
TE containing 1 mM PMSF, and plugs were washed three times in CHEF-TE, then 
digested with Xhol in digestion buffer containing 5 mM spermidine. To analyse 
the entire rDNA array, digested chromosomes were separated by CHEF gel elec- 
trophoresis in 1% agarose in 0.5X TBE, 6 V cm}, using 60-s pulses for 15h and 
90-s pulses for 9 h. For fine mapping of rDNA insertions and to analyse changes in 
repeat number, digested chromosome fragments were separated using a 5-20s 
ramp over 20h. For conventional electrophoresis, DNA fragments were separated 
on 0.6% agarose in 1X TBE and transferred onto Hybond-XL membranes (GE 
Healthcare) by alkaline transfer. Southern blotting was performed as previously 
described* and quantified with a Fujifilm BAS-2500 image reader V1.8 and Multi 
Gauge V2.2 software. 

Probes for Southern analysis. Probe templates for non-rDNA sequences were 
generated by nested PCR and gel purification. The following probes (SGD coor- 
dinates) were used: 

YLR164W: XII, 493,432-493,932. 

YCRO47C: III, 209,361-210,030. 

ARS1216: XU, 450,407-451,150. 

YLR152C: XII, 443,849-444,9 10. 

NTS1: Xhol and Xbal digest of pRS306-NTS1/2. This probe detects all NTS1 and 
NTS2 sequences (pan-rDNA probe). 
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rDNA insertion: BciVI digest of pRS306-NTS1/2. This probe specifically detects 
the plasmid backbone of the URA3 insertion cassette. 

TOM1/YDR457W: IV: 1,370,714-1,371,733 

Flow cytometry. At the indicated time points, 150 ll of meiotic cells were fixed for 
2h at 4°C after addition of 350 ul absolute ethanol. Cells were resuspended in 
500 pl of 50 mM sodium citrate containing 0.7 il RNase A (30 mg ml’, Sigma). 
Cells were incubated for 2 h at 50 °C. 10 pl proteinase K (Amresco) was added and 
cells were deproteinated for 2 h at 50°C. 500 ul of 50 mM sodium citrate contain- 
ing 0.2 ul Sytox Green (Amersham) was added to the cells. Cells were briefly 
sonicated and analysed using a FACScalibur (Becton Dickinson) flow cytometer. 
DNA profiles were generated using CellQuest software. 

Recombination mapping. To determine crossover recombination rates, cells 
were sporulated for 24h in 3 ml SPO and treated with zymolyase (1 mg ml”! in 
1 M sorbitol) to remove ascus walls. Tetrads were dissected by micromanipulation 
and marker segregation was determined by replica plating on appropriate selective 
media. For mapping within the rDNA, only tetratypes were used to calculate 
recombination rates, to avoid distortions originating from non-parental ditypes 
that were probably the result of previous mitotic recombination. 
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Engineered reversal of the B-oxidation cycle for the 
synthesis of fuels and chemicals 


Clementina Dellomonaco!, James M. Clomburg", Elliot N. Miller! 


Advanced (long-chain) fuels and chemicals are generated from short- 
chain metabolic intermediates through pathways that require 
carbon-chain elongation. The condensation reactions mediating this 
carbon-carbon bond formation can be catalysed by enzymes from the 
thiolase superfamily, including f-ketoacyl-acyl-carrier protein (ACP) 
synthases, polyketide synthases, 3-hydroxy-3-methylglutaryl-CoA 
synthases, and biosynthetic thiolases'. Pathways involving these 
enzymes have been exploited for fuel and chemical production, with 
fatty-acid biosynthesis (B-ketoacyl-ACP synthases) attracting the 
most attention in recent years” *. Degradative thiolases, which are 
part of the thiolase superfamily and naturally function in the p- 
oxidation of fatty acids”*, can also operate in the synthetic direction 
and thus enable carbon-chain elongation. Here we demonstrate that a 
functional reversal of the B-oxidation cycle can be used as a metabolic 
platform for the synthesis of alcohols and carboxylic acids with 
various chain lengths and functionalities. This pathway operates with 
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coenzyme A (CoA) thioester intermediates and directly uses acetyl- 
CoA for acyl-chain elongation (rather than first requiring ATP- 
dependent activation to malonyl-CoA), characteristics that enable 
product synthesis at maximum carbon and energy efficiency. The 
reversal of the B-oxidation cycle was engineered in Escherichia coli 
and used in combination with endogenous dehydrogenases and 
thioesterases to synthesize n-alcohols, fatty acids and 3-hydroxy-, 
3-keto- and trans-A?-carboxylic acids. The superior nature of the 
engineered pathway was demonstrated by producing higher-chain 
linear n-alcohols (C= 4) and extracellular long-chain fatty acids 
(C>10) at higher efficiency than previously reported**’°. The 
ubiquitous nature of B-oxidation, aldehyde/alcohol dehydrogenase 
and thioesterase enzymes has the potential to enable the efficient 
synthesis of these products in other industrial organisms. 
Engineering the B-oxidation cycle as a metabolic platform for the 
synthesis of advanced fuels and chemicals (Fig. 1) requires the reverse 
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Figure 1 | Functional reversal of the B-oxidation cycle. The following 
enzymes (gene names in parentheses) are involved in the functional reversal of 
the B-oxidation cycle engineered in this work (shown in blue): (1) thiolase 
(ygeF, fadA); (2) hydroxyacyl-CoA dehydrogenase (fadB); (3) enoyl-CoA 
hydratase (fadB); (4) enoyl-CoA reductase (ydiO). Each turn of the cycle 
generates an acyl-CoA that is two carbons longer than the initial acyl-CoA 
thioester (indicated as C,, + 2). Intermediates in the engineered pathway can be 
converted to a functionally diverse set of molecules using aldehyde-forming 
acyl-CoA reductases and alcohol dehydrogenases (5) and acyl-CoA 
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thioesterases (6). Products for which synthesis was demonstrated in this study 
are boxed. R, side chain attached to the acyl-CoA group of the starter molecule 
(for example, R = H for acetyl-CoA and R = CH; for propionyl-CoA); 2[H], 
NADH/NADPH/ferredoxin. Dotted lines indicate multiple steps. Dashed lines 
without arrowheads connect identical metabolites of different chain length. A 
comparison of n-alcohol production through the engineered -oxidation cycle 
and the recently proposed fatty-acid biosynthesis pathway” is shown in 
Supplementary Fig. 1. 
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operation of this pathway in the absence of its natural substrate (that is, 
fatty acids) and presence of a non-fatty-acid carbon source (for 
example, glucose). Constitutive expression of the fad and ato regulons, 
which encode the f-oxidation system in Escherichia coli’, in the 
absence of fatty acids was achieved by introducing previously reported 
fadR and atoC(c) mutations'’® (Supplementary Table 1). Several 
operons encoding f-oxidation cycle enzymes are also activated by 
the cyclic AMP (cAMP) receptor protein (CRP)-cAMP complex 
and therefore subjected to carbon catabolite repression in the presence 
of glucose"’. This regulatory mechanism was circumvented by repla- 
cing the native crp gene with a cAMP-independent mutant (crp*) 
previously reported to confer a catabolite-derepressed phenotype” 
(Supplementary Table 1). Finally, the arcA gene was deleted to prevent 
ArcA-mediated repression of most operons encoding the B-oxidation 
cycle. 

The genetic modifications outlined above were introduced into 
wild-type E. coli MG1655 (resulting strain named RBO1: MG1655 
fadR atoC(c) crp* AarcA) and in a strain in which other fermentative 
pathways were blocked (resulting strain named RBO2: MG1655 fadR 
atoC(c) crp* AarcA AadhE Apta AfrdA). However, none of the pro- 
ducts that can be derived from the B-oxidation cycle intermediates 
(Fig. 1) was synthesized by these strains (Supplementary Table 2). A 
limitation on the expression of B-oxidation enzymes was ruled out 
after confirming the presence of these activities in strain RBO2 
(Table 1a). 

The expression of appropriate termination enzymes is vital to achieve 
the conversion of the CoA-thioester intermediates in the reversal of the 
B-oxidation cycle to the desired product(s) (Fig. 1). We first targeted the 
synthesis of n-butanol, as this product can be generated in a one-turn 
reversal of the $-oxidation cycle. Unlike the significant activity observed 
for key B-oxidation enzymes, the level of n-butanol dehydrogenase in 
strain RBO2 was very low (Table 1a). To address this issue, two endo- 
genous aldehyde/alcohol dehydrogenases with high sequence and struc- 
tural similarity to clostridial butyraldehyde/butanol dehydrogenase 
were expressed in strain RBO2: that is, L-1,2-propanediol oxidoreductase 
(FucO) and an aldehyde/alcohol dehydrogenase (YqhD) (Supplemen- 
tary Table 3). These modifications enabled a functional one-turn 
reversal of the B-oxidation cycle, as evidenced by the synthesis of n- 
butanol in strains RBO2 [yqhD+] and RBO2 [fucO+] (Fig. 2a). Despite 
the potential for YqhD to catalyse the conversion of butyraldehyde to 
n-butanol", overexpression of FucO led to higher concentration of 
n-butanol and lower concentration of the major fermentation by- 
product, ethanol (Fig. 2a). 

The initiation of the reversal of the B-oxidation cycle (that is, con- 
version of acetyl-CoA to acetoacetyl-CoA) is another key step in the 
engineered pathway. Therefore, we evaluated the effect of overexpres- 
sing atoB-encoded acetyl-CoA acetyltransferase, which has high spe- 
cificity for short-chain acyl-CoA molecules'*’® and a predicted 
acyltransferase (YqeF) with high sequence similarity to AtoB. These 
modifications also supported a functional reversal of the B-oxidation 
cycle, as evidenced by the synthesis of n-butanol in strains RBO2 
[atoB+] and RBO2 [ygeF+] (Fig. 2a). The simultaneous overexpres- 
sion of YqeF and FucO in strain RBO2 yielded a high n-butanol titre 
(1.9g1-'), n-butanol yield (0.19 g n-butanol per g total glucose con- 
sumed) and n-butanol-to-ethanol ratio (>5:1) (Fig. 2a). No n-butanol 
was synthesized upon overexpression of FucO and YqeF in wild-type 
MG1655 (Supplementary Table 2), demonstrating the requirement of 
an active reversal of the B-oxidation cycle. 

Further reduction in the synthesis of by-product ethanol was realized 
through deletion of eutE and yqhD: EutE is an aldehyde dehydrogenase 
with high sequence similarity to AdhE and YqhD overexpression 
favoured the synthesis of ethanol over n-butanol (Fig. 2a). The resulting 
strain (RBO2 AyghD AeutE [yqeF+fucO+]) synthesized 2.2 ¢1 ' of 
n-butanol in 24h at a yield of 0.28g n-butanol per g total glucose 
consumed (Fig. 2a). When grown in a bioreactor using a higher con- 
centration of glucose, this strain produced n-butanol at high titre 
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Figure 2 | Engineered one-turn reversal of the B-oxidation cycle for the 
synthesis of n-butanol and short-chain carboxylic acids. a, Effect of gene 
overexpression and knockout (indicated below x axis) on the synthesis of 
n-butanol and ethanol in strain RBO2 (fadR atoC(c) crp* AarcA AadhE Apta 
AfrdA). Experiments were performed at 30°C for 24h in shake flasks using 
glucose (2% w/v) minimal medium. The n-butanol yield was calculated as g 
n-butanol per g total glucose consumed. b, Kinetics of n-butanol production by 
strain RBO2 AyghD AeutE [yqeF+ fucO+ ]. Cells were cultivated in fermentors 
containing minimal medium supplemented with 5% (w/v) glucose. The 
dissolved oxygen was controlled at 5% of saturation, temperature at 30 °C, and 
pH at 7. c, Synthesis of §-ketobutyric acid (left), -hydroxybutyric acid (centre) 
and trans-2-butenoic acid (right) upon overexpression of thioesterases I (TesA) 
and II (TesB) in strains RBO2, RBO2 AfadB and RBO2 AydiO. Experiments were 
run at 37 °C for 48h in shake flasks using glucose (1% w/v) minimal medium. 
Error bars represent the standard deviation of three replicates. 


(~14g1-'), yield (0.33g n-butanol per g total glucose consumed) 
and rate (~2 g n-butanol per g cell dry weight per h) (Fig. 2b). This 
performance, which was achieved in the absence of rich nutrients and 
without importing foreign genes, is better than reported for other 
organisms engineered for n-butanol production (Supplementary 
Table 4) and also surpasses the n-butanol yield and specific produc- 
tivity reported for native producers'’. The engineered reversal of the 
B-oxidation cycle operated at a maximum flux of 73.4mmol acetyl- 
CoA per g cell dry weight per h (12-18h in Fig. 2b), which exceeds 
the flux reported in the literature for native or engineered fermentative 
pathways’*”. 

A characterization of the engineered B-oxidation pathway was 
conducted to confirm the identity of the enzymes catalysing each 
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individual step and their role in n-butanol synthesis (Table 1). Enzyme 
activity measurements showed a high level of expression of key 
pathway enzymes in strain RBO2 [ygeF+fucO+ ] but negligible activity 
in wild-type MG1655 (Table la). Gene knockout and gene comple- 
mentation experiments along with quantification of fermentation pro- 
ducts (Table 1b) demonstrated that the primary enzymes involved in 
the synthesis of n-butanol through the engineered reversal of the 
B-oxidation pathway are (activities shown in parentheses along with 
reaction numbers, as per Fig. 1): YqeF (predicted acyltransferase: reac- 
tion (1)); FadB (3-hydroxyacyl-CoA dehydrogenase and enoyl-CoA 
hydratase: reactions (2) and (3), respectively); YdiO (predicted acyl- 
CoA dehydrogenase/enoyl-CoA reductase: reaction (4)), and MhpF 
and FucO (aldehyde-forming acyl-CoA reductase and n-butanol 
dehydrogenase, respectively: reaction (5)). 

YdiO is proposed to catalyse the reduction of enoyl-CoA to acyl- 
CoA (reaction (4)). The reverse of this reaction is catalysed by FadE and 
is the only irreversible step in the catabolic operation of the B-oxidation 
cycle’. In agreement with our proposal, deletion of ydiO in strain RBO2 
[ygeF+fucO+] completely abolished n-butanol synthesis (Table 1b). 
Although ydiO was previously proposed to encode an acyl-CoA 
dehydrogenase that would replace FadE during the anaerobic cata- 
bolism of fatty acids*®, a sequence comparison between YdiO and 
E. coli proteins does not reveal a significant similarity to FadE (Sup- 
plementary Table 5). In contrast, YdiO shares high homology with 
crotonobetainyl-CoA reductase (CaiA; Supplementary Table 5). CaiA 
catalyses the reduction of crotonobetainyl-CoA to y-butyrobetainyl- 
CoA”, a reaction similar to that catalysed by YdiO in the reversal of 
the B-oxidation cycle. Moreover, the operon fixABCX is required for 
the transfer of electrons to CaiA”*”* and encodes flavoproteins and 
ferredoxin with high sequence similarity to YdiQRST (Supplementary 
Table 5). This analysis suggests that ferredoxin and flavoproteins 
encoded by ydiQRST” are involved in the transfer of electrons to 
YdiO during the reduction of enoyl-CoA to acyl-CoA. Standard 
Gibbs energy calculations reveal that the engineered reversal of the 
B-oxidation cycle is thermodynamically feasible if ferredoxin is the 
source of reducing power for the conversion of enoyl-CoA to acyl-CoA 
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(Supplementary Table 6). We then propose that the reduction of enoyl- 
CoA to acyl-CoA is mediated by YdiO-YdiQRST. 

The engineered reversal of the B-oxidation cycle generates a diverse 
set of CoA thioester intermediates that can be converted to the corres- 
ponding alcohols and carboxylic acids (Fig. 1). To illustrate product 
synthesis from intermediates other than acyl-CoA, we used thioesterase 
I (TesA)™ and thioesterase II (TesB)” as termination enzymes. Small 
amounts of 3-hydroxybutyric, 3-ketobutyric and trans-2-butenoic 
acids were produced when these thioesterases were overexpressed in 
strain RBO2 (Fig. 2c). The level of these products was significantly 
increased by simultaneous overexpression of thioesterase and yqeF- 
encoded short-chain acyltransferase (Fig. 2c). Further increases in titre 
were realized upon deletion of fadB (~500 mg! * 3-ketobutyric acid) 
and ydiO (~150 mg1 _' and 200 mg!" of 3-hydroxybutyric and trans- 
2-butenoic acids, respectively) (Fig. 2c). 

The operation of multiple cycles of the engineered reversal of the 
B-oxidation pathway, and hence the synthesis of CoA-thioester inter- 
mediates (and products) of longer chain length (C > 4), can be facili- 
tated by the overexpression of FadA, a 3-ketoacyl-CoA thiolase that is 
part of the B-oxidation complex (FadBA) and which possesses broad 
chain length specificity**. Overexpression of FadBA in conjunction 
with thioesterases (TesA, TesB, FadM?’ or YciA7’) in strain RBO3 
(RB0O2 AyqhD AfucO AfadD) resulted in the accumulation of long- 
chain fatty acids in the extracellular medium (Fig. 3a). The fadD 
knockout in strain RB03 prevents re-utilization of the synthesized fatty 
acids. The choice of thioesterase allowed control over both length and 
functionality of the fatty-acid side chain. For example, C16 and C18 
saturated fatty acids were the only products when FadM was over- 
expressed, whereas YciA and TesA overexpression supported the syn- 
thesis of 3-hydroxy (C14:30H) and unsaturated (C18:1) fatty acids, 
respectively (Fig. 3a). When grown in a bioreactor using a higher initial 
concentration of glucose, strain RBO3 [fadBA-fadM-+ ] produced long- 
chain extracellular fatty acids at high titre (~7 gl‘) and yield (0.28 g 
fatty acids per g total glucose consumed) using a mineral salts medium 
without rich nutrients (Fig. 3b). These results are better than reported 
previously using an engineered fatty-acid biosynthesis pathway~ 


Table 1 | Characterization of the engineered reversal of the B-oxidation cycle during n-butanol synthesis 


Strain Enzyme activity (umol per mg protein per min) + s.d. 
THL HBD CRT BDH 

(a) Activities of B-oxidation and butanol dehydrogenase enzymes in wild-type and engineered strains 
MG1655 ND 0.002 + 0.000 ND 0.014+0.001 
RBO2 (fadR atoC(c) crp* AarcA Apta AadhE AfrdA) 0.310 + 0.079 0.304 + 0.032 0.339 + 0.049 0.004 + 0.002 
RBO2 [ygeF+ fucO+] 0.498 + 0.036 0.292 + 0.013 0.334 + 0.017 0.298 + 0.020 
Strain Butanol produced Glucose used Cell growth 

Yield (gg~+) Concentration (gI~1) (gl) (gi) 
(b) Butanol synthesis, glucose utilization and cell growth in strain RBO2 and its derivatives* 
RBO2 [ygeF+ fucO+] 0.182 1.85 10.19 0.72 
Reaction (1): YqeF (predicted acyltransferase) 
RBO2 AygeF [fucO+] 0.019 0.10 5.19 0.32 
RBO2 [fucO+] 0.063 0.23 3.66 0.41 
RBO2 AygeF [fucO+ yqeF+] 0.159 141 7.00 0.43 
Reactions (2) and (3): FadB (3-hydroxyacyl-CoA dehydrogenase and enoyl-CoA hydratase) 
RBO2 AfadB [yqeF+ fucO+] ND ND 2.96 0,19 
RBO2 AfadB [yqeF+ fadB+] 0.157 0.92 5.86 0.52 
RBO2 AfadB [yqeF+] ND ND 1.84 0.11 
Reaction (4): YdiO (predicted acyl-CoA dehydrogenase/enoyl-CoA reductase) 
RBO2 AydiO [yqeF+ fucO+] 0.038 0.12 3.14 0.18 
RBO2 AydiO [yqeF+ ydiO+] 0.163 0.97 5.94 0.57 
RBO2 AydiO [yqeF+] 0.018 0.10 1.66 0.13 
Reaction (5): MhpF (aldehyde-forming acyl-CoA reductase) and FucO (n-butanol dehydrogenase) 
RBO2 AfucO [yqeF+] 0.015 0.04 2.75 0.38 
RBO2 AfucO [yqgeF+ fucO+] 0.151 0.85 5.64 0.51 
RBO2 [ygeF+] 0.088 0.35 4.00 0.68 
RBO2 AmhpF [yqeF+ fucO+] ND ND 0.66 0.07 


BDH, butanol dehydrogenase; CRT, crotonase; HBD, hydroxy-acyl-CoA dehydrogenase; THL, thiolase. 


* Experiments were run for 24h in shake flasks using glucose (2% w/v) minimal medium at 30°C. 


D, not detectable (minimum detectable activity was 0.001 mol per mg protein per min; minimum detection level for butanol was 5.84 mg!~?). 
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Figure 3 | Synthesis of higher-chain (C > 4) carboxylic acids and n-alcohols 
through the engineered reversal of the B-oxidation cycle. a, Accumulation of 
long-chain (C > 10) free fatty acids in the extracellular medium of strain RBO3 
[fadBA+] upon overexpression of different thioesterases (FadM, YciA, TesA, 
TesB). Product yield is shown above the bars (g free fatty acid per g total glucose 
consumed X 100). Experiments were run at 37 °C for 48 h in shake flasks using 
glucose (2% w/v) minimal medium. b, Kinetics of fatty-acid synthesis by strain 
RB03 [fadBA-fadM + ]. Cells were cultivated in fermentors using glucose (3% 
w/v) minimal medium. The dissolved oxygen was controlled at 2% of 
saturation, 37 °C, pH 7. FFAs, free fatty acids. c, Synthesis of n-alcohols in strain 
RBO3 [fadBA+] upon overexpression of alcohol dehydrogenases (YiaY, BetA 
and EutG). Product yield is shown above the bars (g n-alcohol per g total 
glucose consumed X 100). Experiments were run at 37 °C for 48h in shake 
flasks using glucose (2% w/v) minimal medium. d, Effect of alcohol 
dehydrogenase overexpression (YiaY, BetA and EutG) on the chain-length 
distribution of n-alcohols synthesized by strain RB03 [fadBA + ] in the presence 
of 0.5 gl ' propionate. Experiments were conducted as described in ¢. Error 
bars represent the standard deviation of three replicates. 


(Supplementary Table 4). No production of extracellular free fatty 
acids was observed upon overexpression of FadM in strain MG1655 
AadhE Apta AfrdA AfadD (Supplementary Table 2C), demonstrating 
the requirement of an active reversal of the f-oxidation cycle. 
Measurements of total free fatty acids (that is, extracellular plus intra- 
cellular) in strain RBO3 [fadBA-fadM+] and the corresponding con- 
trols showed that the engineered reversal of the B-oxidation cycle 
contributed to the synthesis of 90-95% of the total free fatty acids 
(Supplementary Table 2C). 

The synthesis of longer-chain (C > 4) n-alcohols was also demon- 
strated by overexpressing the appropriate termination enzymes 
(Fig. 3c). We identified native enzymes that could serve as potential 
surrogates for the aldehyde-forming acyl-CoA reductases and alcohol 
dehydrogenases present in organisms that synthesize higher-chain 
linear n-alcohols (Supplementary Table 3). The product titre 
(0.33 g1') and yield (8.3% w/w) achieved upon overexpression of 
YiaY were higher than previously reported*”” (Supplementary Table 
4). Synthesis of odd-chain n-alcohols was demonstrated by supple- 
menting the medium with propionate as the precursor of propionyl- 
CoA (R = CH; in Fig. 1). A clear shift in the distribution of n-alcohols 
was observed: odd-chain alcohols 1-pentanol, 1-heptanol and 1-nonanol 
appeared as fermentation products and the synthesis of even-chain 
alcohols significantly decreased (Fig. 3d). 

The functional reversal of the B-oxidation cycle engineered in this 
work represents a novel and efficient platform for the synthesis of 
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advanced fuels and chemicals. Its superior nature is illustrated in the 
following balanced equation for the synthesis of n-alcohols from glucose: 
nl4 C6H120¢6 > C,Hy +20 + n/2 CO2 + (n/2 — 1) H20 + n/2 ATP, 
with n being the chain length of the n-alcohol (Fig. 1). As can be seen in 
this balance, the engineered pathway has the potential to achieve the 
maximum theoretical yield of n-alcohols on glucose (66.7%, C-mole 
basis) and generates 1 ATP per each two-carbon unit incorporated into 
the n-alcohol molecule. This ATP yield is equivalent to that of efficient 
homo-fermentative pathways found in nature such as ethanol” and 
lactic acid’? fermentations. The high efficiency of the engineered 
reversal of the f-oxidation cycle is possible because it directly uses 
acetyl-CoA as the donor of two-carbon units during chain elongation 
(as opposed to first requiring ATP-dependent activation to malonyl- 
CoA) and it functions with acyl-CoA intermediates, which are the 
precursors of n-alcohols and other important products (Fig. 1). 

The synthesis of n-alcohols through alternative metabolic routes, 
such as the fatty-acid biosynthesis’ and keto-acid” pathways, is less 
efficient. The use of the fatty acid biosynthesis pathway results in the 
net consumption of 1 ATP per molecule of n-alcohol synthesized 
(Supplementary Fig. 1). This inefficiency is due to the consumption 
of ATP in the synthesis of malonyl-ACP, the donor of two-carbon 
units for chain elongation, and the use of acyl-ACP intermediates, 
which need to be converted to free acids and acylated (ATP-consuming 
step) before their reduction to alcohols can be achieved (Supplemen- 
tary Fig. 1). The keto-acid pathway” is also less efficient than the 
reversal of the B-oxidation cycle: for example, the maximum theor- 
etical yield of n-hexanol, the highest-chain linear n-alcohol reported 
with the keto-acid pathway, is only 50% C-mole (2 glucose > 
n-hexanol + ATP + 2[H] + 6 CO,). 


METHODS SUMMARY 

Strains and plasmids. Wild-type K12 Escherichia coli strain MG1655 was used as 
the host to implement metabolic engineering strategies. Genetic manipulations, 
including construction of plasmids, are described in the Methods. All resulting 
strains and plasmids are listed in Supplementary Table 7. 

Culture medium, cultivation conditions and analytical methods. The mineral 
salts medium described in Methods was used. Ampicillin (100 pg ml~'), kanamycin 
(50 ug ml 1; chloramphenicol (34 jig ml~ 1), anhydrotetracycline (100 ng ml and 
isopropyl B-p-1-thiogalactopyranoside (IPTG; 0.1 mM) were included when appro- 
priate. Several colonies were used to inoculate 30 ml of medium in 50 ml baffled flasks 
and incubated at 37°C in a rotary shaker (200r.p.m.) for 24h. An appropriate 
amount of pellet from this pre-culture was used to inoculate 30 ml of fresh medium 
in 50 ml baffled flasks to a target initial optical density at 550 nm of 0.05. These 
cultures were then incubated at 30 °C in a rotary shaker (200 r.p.m.) for 24h, unless 
otherwise stated. The same inoculation procedure was used for additional fermenta- 
tions conducted in a SixFors multi-fermentation system (Infors HT) with an air 
flowrate of 1.5N1I~'h' and control of dissolved oxygen, pH, and temperature at 
specified values. Metabolite identification and quantification by nuclear magnetic 
resonance (NMR), gas chromatography-mass spectroscopy (GC-MS), gas chro- 
matography-flame ionization detection (GC-FID), and high-performance liquid 
chromatography (HPLC) are described in Methods. Product yields were calculated 
as grams of product synthesized per total grams of glucose consumed. All values 
reported as different are significant with a P-value lower than 0.05. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Strains and plasmids. Gene knockouts were introduced in MG1655 and its 
derivatives by Pl phage transduction as previously described*’. The native 
cAMP receptor protein (CRP) was replaced with a cAMP-independent CRP 
(CRP*)** by introducing the crp* gene into crp deletion mutants via P1 phage 
transduction using a lysate from strain ET25 (ref. 13), followed by selection on (1% 
w/v) glycerol minimal media plates with tetracycline. Construction of plasmids 
(medium-copy pTrcHis2A, Invitrogen, and low-copy pZSblank (ref. 31)) is 
described in the ‘Genetic methods’ section. All resulting strains and plasmids 
are listed in Supplementary Table 7. 

Reagents. Chemicals were obtained from Fisher Scientific and Sigma-Aldrich. 
Culture medium. The minimal medium designed previously? with NayHPO, in 
place of K, HPO, and supplemented with 20 gl * glucose, 40 g1-* calcium carbonate, 
100 tM FeSO,, 5 mM calcium pantothenate, 3.96 mM Na,HPO,, 5 mM (NH4)2SO4 
and 30mM NH,Cl was used. Fermentations conducted in the SixFors multi-fer- 
mentation system also included 1 mM betaine. Ampicillin (100 pg ml’), kanamycin 
(50 pg ml~ ty: chloramphenicol (34 jig ml~ Ay. anhydrotetracycline (100 ng ml 1) and 
isopropyl B-b-1-thiogalactopyranoside (IPTG; 0.1mM) were included when 
appropriate. Several colonies were used to inoculate 30 ml of medium in 50 ml 
baffled flasks and incubated at 37 °C in a rotary shaker (200 r.p.m.) for 24h. An 
appropriate amount of pellet from this pre-culture was used to inoculate 30 ml of 
fresh medium in 50 ml baffled flasks to a target initial optical density at 550 nm of 
0.05. These cultures were then incubated at 30 °C ina rotary shaker (200 r.p.m.) for 
24h, unless otherwise stated. 

Genetic methods. Standard recombinant DNA procedures were used for gene 
cloning, plasmid isolation and electroporation**. Manufacturer protocols and 
standard methods were followed for DNA purification (Qiagen), restriction endo- 
nuclease digestion (New England Biolabs) and DNA amplification (Stratagene 
and Invitrogen). For plasmid construction, genes were amplified from MG1655 
genomic DNA using primers designed to create 15 bp of homology on each end of 
the gene insert for subsequent recombination into the desired plasmid. Plasmids 
were linearized using restriction endonuclease digestion, then recombined with 
the appropriate gene(s) using an In-Fusion Dry-Down PCR Cloning Kit 
(Clontech) and subsequently used to transform chemically competent Fusion 
Blue cells (Clontech). Transformants that grew on LB plates containing the appro- 
priate antibiotic were struck for isolation, and then subjected to preliminary 
screening by PCR. Colonies passing preliminary inspection were then individually 
grown for plasmid purification. Purified plasmids were confirmed to have the 
appropriate insert both by PCR as well as restriction endonuclease digest verifica- 
tion. Plasmids in each case include the plasmid-based promoter, a ribosomal 
binding site for each gene, MG1655 gene(s), and a plasmid-based terminator. 
Resulting plasmids are listed in Supplementary Table 7. 

Sequencing of regulators. Chromosomal DNA from strain RBO2 served as the 
template for PCR using Phusion High-Fidelity DNA polymerase (Finnzymes). 
PCR products, containing the gene(s) of interest as well as approximately 
500 bp upstream of each gene, were purified using a QIAquick PCR purification 
kit (Qiagen) and sent to Lone Star Labs for sequencing. Sequencing results were 
analysed using Sequence Scanner v1.0 (Applied Biosystems) and Serial Cloner v2.1 
(Serial Basics, http://serialbasics.free.fr/Home/Home.html). 

Metabolite identification. The identity of metabolic products was determined 
through one-dimensional proton nuclear magnetic resonance (NMR) spectroscopy. 
60 pl of D2O and 1 pl of 600 mM NMR internal standard TSP [3-(trimethylsilyl) 
propionic acid-D4, sodium salt] were added to 540 ul of the sample (culture super- 
natant). The resulting solution was then transferred to a 5-mm NMR tube, and 
one-dimensional proton NMR spectroscopy was performed at 25 °C in a Varian 
500-MHz Inova spectrometer equipped with a Penta probe (Varian) using the 
following parameters: 8,000-Hz sweep width, 2.8-s acquisition time, 256 acquisi- 
tions, 6.3 is pulse width, 1.2-s pulse repetition delay, and pre-saturation for 2s. 
The resulting spectrum was analysed using FELIX 2001 software (Accelrys 
Software). Peaks were identified by their chemical shifts and J-coupling values, 
which were obtained in separate experiments in which samples were spiked with 
metabolite standards (2 mM final concentration). 

Identification of n-alcohols was conducted through GC-MS following a modi- 
fication of the method reported previously”. The analysis was performed on an 
Agilent 6890 GC/5973 MS (Agilent Technologies) instrument with a HP-5ms 
capillary column (30m X 0.25mm X 0.25 um). I ml of supernatant of culture 
broth was extracted with 500 pl of GC standard grade n-hexane (Fluka) and 
1-heptanol (Sigma-Aldrich) was used as internal standard. 0.5 ll of the extracted 
sample was injected using a 20:1 split at 250 °C. The oven temperature was initially 
held at 75 °C for 2 min and then raised with a gradient of 5 °C per min to 280 °C 
and held for 2 min. Helium (Matheson Tri-Gas) was used as the carrier gas with a 
14-Ibin * inlet pressure. The injector and detector were maintained at 255°C. 


Identification of fatty acids was performed on a Shimadzu Auto-System GC 

2010 (Shimadzu) equipped with a DB-5MS capillary column (30m X 0.25 mm 
X 0.25 jum) and coupled with a QP-2010 mass detector. The following method was 
used: an initial temperature of 50 °C was held for 2 min and then ramped to 220 °C 
at 4 °C per min and held for 10 min (ref. 2). Extraction and derivatization proce- 
dures are described in the “Metabolite quantification’ section. 
Metabolite quantification. The quantification of glucose, organic acids, ethanol 
and butanol was conducted by high-performance liquid chromatography (HPLC). 
Samples (culture supernatant) were analysed with ion-exclusion HPLC using a 
Shimadzu Prominence SIL 20 system (Shimadzu Scientific Instruments) equipped 
with an HPX-87H organic acid column (Bio-Rad) with operating conditions to 
optimize peak separation (0.3 ml min | flowrate, 30mM H,SO, mobile phase, 
column temperature 42 °C)**. 

Quantification of longer chain (C > 4) n-alcohols was conducted through gas 
chromatography (GC) in a Varian CP-3800 gas chromatograph (Varian 
Associates) equipped with a flame ionization detector (GC-FID). Sample extrac- 
tion procedure was as described above. The separation of alcohol compounds was 
carried out using a FactorFour VF-5ht column (15 m, 0.32 jum internal diameter, 
0.10 um film thickness; Varian Associates). The oven temperature was initially 
held at 40°C for 1 min and then raised with a gradient of 30°C min * to 130°C 
and held for 4min. The temperature was then raised with a gradient of 
15°Cmin | to 230°C and held for 4 min. Helium (1 ml min ', Matheson Tri- 
Gas) was used as the carrier gas. The injector and detector were maintained at 
250°C. A 0.5-tl sample was injected in splitless injection mode. 

For total free fatty acid quantification, 5 ml of culture was placed in a 15-ml 

Corning CentriStar centrifuge tube (Corning Inc.) in an ice bath and sonicated 
using a Branson Sonifier 450 (Emerson Industrial Automation) for 5 min at a duty 
cycle of 30% and a power setting of 1. The tubes were then centrifuged at 12,000g¢ 
for 5 min and the appropriate volume of supernatant recovered. For extracellular 
free fatty-acid analysis, the supernatant from non-sonicated culture samples was 
used instead. 2 ml of supernatant were acidified with 200 1] of acetic acid, supple- 
mented with 100 mg of octanoic acid as an internal standard, and extracted with 
2 ml of n-hexane-chlorophorm (4:1, v/v)*”. The organic layer was nitrogen evapo- 
rated to near dryness, re-dissolved in 1 ml of a mixture of cholophorm:methanol: 
hydrochloric acid (10:1:1, vol/vol/vol)'® and incubated in a sealed vial at 90 °C for 
Lh. Fatty-acid methyl esters were extracted with 1 ml of n-hexane-chlorophorm 
(4:1, v/v) after addition of 500 pl of 0.9% (wt/vol) NaCl in water. Gas-liquid 
chromatography was performed with 1 ll of the n-hexane:chlorophorm solution 
ona Varian FactorFour VF-5ht capillary column (Varian Associates, 15 m length, 
0.32 um diameter and 0.1 jm film thickness) according to the following method: 
50 °Cheld for 1 min, 30°C min’ ' to 160°C, 15°C min ' to 200°C, 200 °C held for 
1.5 min, 10°C min ! to 225°C, and 225°C held for 15 min (ref. 10). 
Enzyme assays. For measurement of enzymatic activities, cells from 24-h shake 
flask cultures were washed twice with 9g1~' sodium chloride under anaerobic 
conditions and stored at — 80 °C until use. Cell extracts for all assays were prepared 
as follows under anaerobic conditions: 40 units of OD550 nm were re-suspended in 
1 ml of 100 mM Tris-HCl buffer (pH 7.0) with 1mM DTT. After cellular disrup- 
tion using a Disruptor Genie (Scientific Industries), cellular debris was removed by 
centrifugation (13,000g, 4°C, 10 min) and the supernatant used as cell extract. 
Absorbance changes for all assays were monitored in a Biomate 5 spectropho- 
tometer (Thermo Scientific). The linearity of reactions (protein concentration and 
time) was established for all assays and the non-enzymatic rates were subtracted 
from the observed initial reaction rates. Enzymatic activities are reported as mol 
of substrate per min per mg of cell protein and represent averages for at least three 
cell preparations. Protein concentration was measured using the Bradford Assay 
Reagent (Thermo Scientific) with BSA as a standard. 

Acetyl-CoA acetyltransferase (THL) activity was determined using acetoacetyl- 
CoA and CoA as substrates, and the decrease in acetoacetyl-CoA concentration was 
measured at 303 nm (ref. 38). §-Hydroxybutyryl-CoA dehydrogenase activity was 
measured at 340 nm by monitoring the decrease in NADH concentration resulting 
from B-hydroxybutyryl-CoA formation from acetoacetyl-CoA (ref. 39). Crotonase 
activity was measured by monitoring the decrease in crotonyl-CoA concentration 
at 263 nm, which results from f-hydroxybutyryl-CoA formation from crotonyl- 
CoA (ref. 39). To measure butanol dehydrogenase activity, the decrease in 
NAD(P)H concentration resulting from butanol formation from butyraldehyde 
is monitored at 340 nm under anaerobic conditions at 30 °C (ref. 40). 


31. Yazdani, S.S.& Gonzalez, R. Engineering Escherichia coli for the efficient conversion 
of glycerol to ethanol and co-products. Metab. Eng. 10, 340-351 (2008). 

32. Cirino, P. C., Chin, J. W. & Ingram, L. O. Engineering Escherichia coli for xylitol 
production from glucose-xylose mixtures. Biotechnol. Bioeng. 95, 1167-1176 
(2006). 

33. Neidhart, F. C., Bloch, P. L. & Smith, D. F. Culture media for enterobacteria. 

J. Bacteriol. 119, 736-747 (1974). 


©2011 Macmillan Publishers Limited. All rights reserved 


34. 
35. 


36. 


37. 


LETTER 


Sambrook, J. & Russell, D. W. Molecular Cloning: a Laboratory Manual (Cold Spring 38. Wiesenborn, D. P., Rudolph, F. B. & Papoutsakis, E. T. Thiolase from Clostridium 


Harbor Laboratory, 2001). acetobutylicum ATCC824 and its role in the synthesis of acids and solvents. Appl. 
Atsumi, S. et al. Metabolic engineering of Escherichia coli for 1-butanol production. Environ. Microbiol. 54, 2717-2722 (1988). 

Metab. Eng. 10, 305-311 (2008). 39. Hartmanis, M. G. N. & Gatenbeck, S. Intermediary metabolism in Clostridium 
Dharmadi, Y. & Gonzalez, R.A better global resolution function and a novel iterative acetobutylicum—levels of enzymes involved in the formation of acetate and 
stochastic search method for optimization of high-performance liquid butyrate. Appl. Environ. Microbiol. 47, 1277-1283 (1984). 

chromatographic separation. J. Chromatogr. A 1070, 89-101 (2005). 40. Durre, P., Kuhn, A., Gottwald, M. & Gottschalk, G. Enzymatic investigations on 
Lalman, J.A. & Bagley, D. M. Extracting long-chain fatty acids from a fermentation butanol dehydrogenase and butyraldehyde dehydrogenase in extracts of 
medium. J. Am. Oil Chem. Soc. 81, 105-110 (2004). Clostridium acetobutylicum. Appl. Microbiol. Biotechnol. 26, 268-272 (1987). 


©2011 Macmillan Publishers Limited. All rights reserved 


LETTER 


doi:10.1038/nature10335 


Modulation of Rab GTPase function by a protein 
phosphocholine transferase 


Shaeri Mukherjee'*, Xiaoyun Liu'*, Kohei Arasaki', Justin McDonough’, Jorge E. Galan! & Craig R. Roy! 


The intracellular pathogen Legionella pneumophila modulates the 
activity of host GTPases to direct the transport and assembly of the 
membrane-bound compartment in which it resides’ °. In vitro studies 
have indicated that the Legionella protein DrrA post-translationally 
modifies the GTPase Rab1 by a process called AMPylation’. Here we 
used mass spectrometry to investigate post-translational modifica- 
tions to Rab1 that occur during infection of host cells by Legionella. 
Consistent with in vitro studies, DrrA-mediated AMPylation of a 
conserved tyrosine residue in the switch II region of Rabl was 
detected during infection. In addition, a modification to an adjacent 
serine residue in Rab1 was discovered, which was independent of 
DrrA. The Legionella effector protein AnkX was required for this 
modification. Biochemical studies determined that AnkX directly 
mediates the covalent attachment of a phosphocholine moiety to 
Rabl. This phosphocholine transferase activity used CDP-choline 
as a substrate and required a conserved histidine residue located in 
the FIC domain of the AnkX protein. During infection, AnkX 
modified both Rab1 and Rab35, which explains how this protein 
modulates membrane transport through both the endocytic and 
exocytic pathways of the host cell. Thus, phosphocholination of 
Rab GTPases represents a mechanism by which bacterial FIC- 
domain-containing proteins can alter host-cell functions. 

Legionella pneumophila is an intracellular pathogen that translo- 
cates proteins called effectors into the host-cell cytosol using a type 
IV secretion system called Dot/Icm®. The Legionella protein DrrA (also 
known as SidM) is an effector that targets the host GTPase Rab1 (refs 
1-3, 5). Initially identified as a Rabl-specific guanine nucleotide 
exchange factor (GEF), recent studies showed that the amino-terminal 
region of DrrA has structural similarity to glutamine synthetase ade- 
nylyl transferase (GS-ATase) and shares the catalytically important 
sequence motif G-X,,-D-X-D, which enables DrrA to AMPylate the 
Tyr 77 residue in the class II switch region of Rab1B’. To determine if 
the in vitro activity described for DrrA is biologically relevant we 
examined whether the endogenous DrrA protein mediates Rab1l 
AMPylation when delivered into host cells during Legionella infection. 

Cells were infected with a strain of Legionella that has a functional 
Dot/Icm system that delivers effectors into host cells (wild type) or an 
isogenic AdotA mutant that has a non-functional Dot/Icm system, and 
Rabl protein was analysed by liquid chromatography-tandem mass 
spectrometry (LC-MS/MS) (Fig. 1a). Two different modifications in 
the switch II region in Rab1 were detected after infection with wild- 
type Legionella. A fragment that corresponded to an AMPylated 
TITSSYYR peptide (mass to charge ratio, m/z = 660.5) was detected. 
Unexpectedly, a form of this peptide with an unknown moiety of 
183 Da (m/z = 578.5) was also detected (Fig. 1a). The AdotA mutant 
revealed that both modifications required the delivery of effector pro- 
teins into host cells during infection. Thus, Rabl is modified during 
Legionella infection by AMPylation and by a second unknown post- 
translational mechanism. 

Cells were infected with mutant strains of Legionella deficient in 
effectors that could be involved in AMPylation of Rab1. In addition to 


the AdrrA mutant, a AankX mutant of Legionella was examined. The 
AnkX protein contains a FIC domain, which for other bacterial effectors 
has been shown to have an enzymatic activity that promotes the 
AMPylation of small GTPases”. When microinjected into mammalian 
cells the AnkX protein disrupts membrane transport in the secretory 
pathway and interferes with the sorting of transferrin from early endo- 
somes, consistent with AnkX being an effector that disrupts the activities 
of host membrane transport proteins, potentially by Rab AMPylation™. 

Rab1 AMPylation was not detected in the samples isolated from cells 
infected with the AdrrA mutant, indicating that DrrA is the primary 
effector mediating Rabl1 AMPylation in vivo (Fig. 1a). The unknown 
modification (m/z = 578.5) was detected after infection with the AdrrA 
mutant, but was not detected after infection with the AankX mutant. 
Thus, the unknown modification to Rab1 that occurs during infection 
requires AnkX. Defects in Rabl modifications exhibited by these 
Legionella mutants were complemented upon the introduction of plas- 
mids that restored DrrA and AnkX production (Supplementary Fig. 1a). 
MS/MS analysis revealed that the unknown 183 Da moiety was attached 
to Ser 79 of Rabl1A, adjacent to the Tyr 80 residue AMPylated by DrrA 
(Fig. 1b). These residues correspond to Ser 76 and Tyr 77 in Rab1B. 

Purified DrrA radiolabelled GST-Rab1 in vitro when **P-a-labelled 
ATP was used as a substrate, but no labelling was detected using *’P-y- 
labelled ATP, validating that DrrA mediates the attachment of AMP to 
Rab] (Fig. 1c). The structurally distinct N-terminal region of DrrA was 
sufficient for AMPylation*'*'’, and no AMPylation activity was 
detected for DrrA(340-533) or the DrrA(D110A,D112A) variant hav- 
ing the G-X,,;-D-X-D adenylyl transferase domain inactivated. The 
effector AnkX was unable to efficiently AMPylate Rabl, the GTP- 
locked Rab1(Q70L) variant or the GDP-locked Rab1(S25N) variant, 
indicating that purified AnkX does not have robust Rabl AMPylation 
activity (Fig. 1c). 

The nature of the unknown modification to Rab1 requiring AnkX was 
investigated further. Cells were transfected with a plasmid encoding either 
AnkX or the variant AnkX(H229A), which has the essential histidine 
residue in the FIC domain changed to alanine. Roughly 70% of the 
Rab] isolated from cells producing AnkX had the 183 Da moiety attached, 
whereas Rab] isolated from cells producing the AnkX(H229A) protein 
was unmodified (Fig. 2a). Thus, AnkX is both necessary and sufficient 
to promote a novel post-translational modification to Rab1 by a pro- 
cess that requires a functional FIC domain. 

For molecules <200 Da, the elemental composition can often be 
determined from a highly accurate mass measurement'*. High- 
resolution MS measurements obtained for the modified Rab1 peptide 
isolated from cells producing AnkX revealed that the moiety attached 
to the Ser79 residue had an accurate mass of 183.0661 Da (Sup- 
plementary Fig. 1b). This moiety did not match any known post- 
translational modifications, but when a metabolite database (http:// 
metlin.scripps.edu) was searched a near perfect match was made to 
the molecule phosphocholine, which has an exact mass of 183.0660 Da. 
The protonated moiety attached to the Rab1 peptide was selected and 
further dissociated by multi-stage MS analysis (MS/MS/MS). The 


Section of Microbial Pathogenesis, Yale University School of Medicine, Boyer Center for Molecular Medicine, Yale University, 295 Congress Avenue, New Haven, Connecticut, CT-06536, USA. 


*These authors contributed equally to this work. 


00 MONTH 2011]! VOL 000 | NATURE | 1 


©2011 Macmillan Publishers Limited. All rights reserved 


LETTER 


a m/z = 496 m/z = 660.5 m/z = 578.5 b 100 TITSSY(amp)YR (m/z = 660.5) [MH-Adenine]* 
No AMPylated Unknown os 1,184.5 
modification peptide modification 9 80 
o 
3.8 x 107 1.5 x 108 1.7 x 108 2 60 [MH-Adenine-H,O]* 
§ 
LN ne ne 
oO 
2 40 
‘= y2 Bet 
s b2 3338.3 b4 y7 
2 29 4 215.2 316.3 403.3 202-9 1,218.8 
3.8 x 107 1.5 x 108 1.7.x 108 { { 3 
0 
AdotA 200 300 400 500 600 700 800 900 1,000 1,100 1,200 
120 — ——— en 0 
7 100 9 tiTSs*VYR (m/z = 578.5) [MH,-184]* 
€ [MH,-H,O]2* 706 
5 3.8x107 1.5 x 108 1.7 x 108 @ 80 569.5 ‘ 
= & [MH,-184-H,O}* | [TITSpSYYR+H-H,OJ* 
§ AankX zg 60 954.6 1,052.5 
S cs 
z © 40 y42+ y6 b7 \ [TITSpSYYR+H]* 
& g 3777 Ve 168 041.6 1 |981-5\' 1,070.6 
3.8 x 107 1.5 x 108 1.7 x 108 @ op J 184.1 b3y2 52+ 753.4 818.6 y5 \ 
c b2 316.4 338.4] ¥ 840.5 
AdrrA 215.2 \ 420.8 
0 
200 300 400 500 600, 700 800 900 1,000 1,100 
3.8 x 107 1.5 x 108 1.7.x 108 4 
30 
AankX AdrrA c A oo . ee a 
an Ir ~ \) \) 
oo oe" ow oi od o = 
A \ \ x 9 ps iN \ NOT oN a0 soy 
oe go oe e oe ow 2d) 2 ad He 08 
x x x x 
oe gah gg gah gn on ow mA oP OaP™ gah ™ pat pak” pat 
82P-c-ATP 32p-y-ATP 32P_¢.-ATP 32P-¢-ATP 


Figure 1 | Legionella infection mediates two different post-translational 
modifications to Rabl. a, 3-Flag—Rab1 was isolated from HEK293 FcyRII 
cells after infection with the indicated Legionella strains. LC-MS/MS analysis 
produced the extracted ion chromatograms. The peak in each graph indicates 
the amount of Rab1 peptide TITSSYYR with no modifications (m/z = 496.0), 
peptide containing an AMP moiety (m/z = 660.5), and peptide with an 
unknown modification (m/z = 578.5). b, MS/MS spectra obtained for the 
AMPylated Rab1 peptide (TITSSY(amp) YR, top) and the Rab1 peptide with 
the unknown modification (TITSS*YYR, bottom) showing m/z values of their 


fragments generated were of the sizes predicted for phosphocholine 
(Fig. 2b) and matched the MS/MS spectrum obtained following dis- 
sociation of a phosphocholine standard (Supplementary Fig. 2), sug- 
gesting that Rab1 is phosphocholinated by AnkX. 

If AnkX were functioning directly as a phosphocholine transferase, 
the host molecule most likely to be used as a substrate in this reaction 


fragments upon collision-induced dissociation. Fragments of the AMPylated 
peptide (top spectrum) have a mass shift of 329 starting at y3 (y3-y7), 
indicating a modification site at the first Tyr from the N terminus of the peptide. 
Fragments of the peptide with the unknown modification have a mass increase 
of 165 starting at y4 (y4—y6), indicating the second Ser from the N terminus was 
modified. c, Autoradiographs reveal AMPylated Rab1 from in vitro reactions 
containing the recombinant proteins indicated on the top of each panel and the 
radiolabelled nucleotides indicated below. 


would be CDP-choline, which is an intermediate used to synthesize 
phosphatidylcholine’’. Indeed, phosphocholination of Ser 79 on Rab1 
was detected for in vitro reactions containing CDP-choline and Ankx, 
but not in reactions containing DrrA (Fig. 2c) or the AnkX(H229A) 
protein (Supplementary Fig. 3). Increasing the amount of AnkX in the 
in vitro reaction resulted in higher levels of phosphocholinated Rab1 
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Figure 2 | The Legionella effector AnkX functions as a Rab phosphocholine 
transferase. a, LC-MS/MS analysis of 3x-Flag-Rab1 isolated from HEK293 
cells that were either untransfected or transfected with a plasmid encoding 
GFP-tagged AnkX, AnkX(H229A) or DrrA. Extracted ion chromatograms 
indicate the amount of Rab1 peptide TITSSYYR with no modification 

(m/z = 496) and peptide with the unknown modification (m/z = 578.5). b, MS/ 
MS/MS analysis on the Rab1 peptide TITSSYYR with the unknown 
modification. The m/z = 184 peak corresponding to the protonated moiety 
attached to the Rab1 peptide was selected and subjected to further dissociation. 
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Indicated are the fragments identified in the MS/MS/MS spectrum that 
matched the fragments predicted upon dissociation of the protonated 
phosphocholine molecule. c, The peak in each graph indicates the amount of 
Rab1 peptide TITSSYYR with no modifications (m/z = 496) and 
phosphocholinated peptide (m/z = 578.5) after in vitro incubation of Rab1 with 
either DrrA or AnkX in the presence of CDP-choline. d, Immunoblots from in 
vitro reactions that contained Rab1 and the indicated amounts of AnkX. Blots 
were probed to detect phosphocholinated Rab1 (anti-PC) and total Rab] (anti- 
Rab1) in each reaction. 
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Figure 3 | AnkX and DrrA have overlapping but non-identical Rab 
specificities. a, Extracted ion chromatograms of 3 X -Flag—Rab6 isolated from 
HEK293 cells producing either DrrA or AnkX. Shown are graphs for the 
unmodified Rab6 peptide SLIPSYIR (m/z = 475) and the AMPylated 

(m/z = 639.5) and phosphocholinated (m/z = 557.5) forms of this peptide. 

b, Extracted ion chromatograms of 3 X -Flag—Rab35 isolated from HEK293 cells 
producing either DrrA or AnkX. Shown are graphs for the unmodified Rab35 


being detected by anti-phosphocholine immunoblot analysis (Fig. 2d), 
validating that Rab1 is phosphocholinated by AnkX. 

Phosphocholinated proteins in the size range of Rab GTPases were 
detected in lysates from cells producing AnkX and were not observed 
in lysates from cells producing AnkX(H229A) (Supplementary Fig. 4a). 
Phosphatidylinositol 4-phosphate and phosphatidic acid levels were 
not affected in cells producing AnkX, suggesting that there is no indirect 
effect on phospholipid metabolism (Supplementary Fig. 4b). The intra- 
cellular pathogen Coxiella burnetii translocates a FIC-domain effector 
called CBU_2078 into host cells*’. Although proteins reacting with the 
anti-phosphocholine antibody were found in the size range of small 
GTPases from cell lysates producing CBU_2078, there was no evidence 
of Golgi fragmentation or endosome enlargement in these cells 
(Supplementary Fig. 4a, c). Thus, defects in host membrane transport 
in AnkX-producing cells probably results from phosphocholination of 
a specific subset of Rab GTPases, although it cannot be excluded that 
CDP-choline consumption might augment these effects. 

The repertoire of Rab proteins that could be modified by Legionella 
effectors in vivo was investigated. Modifications to the Rab5 protein 
were not detected in cells producing either AnkX or DrrA (Sup- 
plementary Fig. 5). DrrA mediated the AMPylation of Rab6 on 
Tyr 82, but phosphocholination of Rab6 mediated by AnkX was not 
detected (Fig. 3a and Supplementary Fig. 5). Rab35 is a Rab1 family 
member that regulates the sorting of cargo from early endosomes, 
and interfering with Rab35 function results in enlarged early endo- 
somes”. Importantly, specific perturbations in Rab35 function result 
in a cellular phenotype that closely mirrors the defects in endosome 
morphology observed in cells microinjected with purified AnkX”™*. 
Phosphocholinated Rab35 was detected in samples isolated from cells 
producing AnkX, and AMPylated Rab35 was detected from cells pro- 
ducing DrrA (Fig. 3b). During infection, phosphocholination of Rab35 
required AnkX and AMPylation of Rab35 required DrrA (Fig. 3c). 
Thus, AnkX has specificity for Rab1 family members. 

To test whether previously described cellular disruptions mediated 
by AnkX required the FIC-domain-dependent phosphocholine trans- 
ferase activity (Fig. 4a), cellular phenotypes mediated by AnkX and the 
AnkX(H229A) mutant were compared. Disruption of the Golgi appar- 
atus and a block in secretion of host alkaline phosphatase into the 
culture supernatant were observed in cells producing AnkX but not 
in cells producing AnkX(H229A) (Fig. 4b and Supplementary Fig. 7). 
Importantly, when the Ser 79 residue in Rab1 was changed to alanine, 
the variant protein was no longer phosphocholinated by AnkX (Sup- 
plementary Fig. 8a); however, Rab1(S79A) interfered with secretion of 
alkaline phosphatase when produced in cells (Supplementary Fig. 8b). 
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peptide TITSTYYR (m/z = 503.0), the AMPylated peptide (m/z = 667.5) and 
the phosphocholinated peptide (m/z = 585.5). ¢, 3X-Flag-Rab35 was isolated 
from HEK293 FcyRII cells after infection with the indicated Legionella strains. 
The peak in each graph indicates the amount of Rab35 peptide TITSTYYR with 
no modifications (m/z = 503), peptide containing an AMP moiety 

(m/z = 667.5) and phosphocholinated peptide (m/z = 585.5). WT, wild type. 


Thus, AnkX is targeting a residue in Rab1 that is critical for function. 
There was a significant increase in the number of cells containing 
enlarged early endosomes in cells producing AnkX compared to cells 
producing AnkX(H229A) (Fig. 4c and Supplementary Fig. 7), consist- 
ent with the function of Rab35 being perturbed by phosphocholination. 
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Figure 4 | AnkX-mediated phosphocholination modulates the function of 
Rab1 and Rab35. a, Schematic representation of the AnkX protein showing 
the location of the FIC domain and the four predicted ankyrin repeat homology 
domains (A1l-A4). The amino acid sequence in a conserved region of the FIC 
domain containing the essential His 229 residue is shown. b, Secretion of 
alkaline phosphatase into the culture supernatant was measured for HEK293 
cells producing either GFP, GFP-AnkX or GFP-AnkX(H229A) as indicated on 
the x-axis. Data are the mean + standard deviation (s.d.) calculated from three 
independent sample wells. c, The disruption of early endosomes was assessed in 
COS7 cells producing either GFP-AnkX, GFP—AnkX(H229A) or GFP alone 
after staining for EEA1 (mean = s.d., n = 200, *P < 0.005 compared to the GFP 
alone control). d, Binding of recombinant connecdenn to 3X -Flag- 
Rab35(S22N) isolated from cells producing either GFP-CBU_2078 or GFP- 
AnkX or GFP-AnkX(H229A) was assessed by co-precipitation. The 
Coomassie-stained SDS-PAGE gel indicates the amount of connecdenn and 
3X -Flag-Rab35(S22N) in each precipitate. The anti-phosphocholine 
immunoblot indicates that Rab35(S22N) isolated from cells producing GFP- 
AnkX was phosphocholinated. e, Binding of purified His-tagged DrrA(340- 
533) to 3X-Flag-Rab1A isolated from cells producing either GFP-CBU_2078 
or GFP-AnkX or GFP-AnkX(H229A) was assessed by co-precipitation. The 
immunoblots indicate the amounts of DrrA(340-533) and the levels of 
phosphocholinated 3X -Flag-Rab1A present in each precipitate. 
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Because GEF proteins are essential for Rab activation, the effect of 
phosphocholination on the binding of Rab-specific GEFs was analysed. 
The eukaryotic connecdenn proteins are the only known GEFs for 
Rab35 and are required for Rab35 function in vivo’. A pronounced 
defect in binding of connecdenn was observed for phosphocholinated 
Rab35 isolated from cells producing AnkX (Fig. 4d), which would 
explain why AnkX overproduction mimics the cellular phenotype 
observed when connecdenn has been silenced in mammalian cells”. 
By contrast, there was no defect in the binding of phosphocholinated 
Rab1 with the GEF domain of DrrA (Fig. 4e), similar to what has been 
observed for DrrA interactions with AMPylated Rab1 (ref. 7). Thus, 
post-translational modifications mediated by the effectors AnkX and 
DrrA modulate the function of Rab GTPases during infection by tailor- 
ing the repertoire of proteins that interact with the modified GTPase. 

The characterization of DrrA and AnkX provides an example of 
Legionella having structurally distinct proteins with different bio- 
chemical activities that modulate the function of host vesicle transport 
proteins similarly. This concept of functional redundancy has been 
postulated but not shown clearly. The differences observed in the in 
vivo specificities shown by these two effectors, however, demonstrate 
that they are also likely to have roles in modulating Rab protein func- 
tions that do not overlap, which could explain why positive selection 
has led to the emergence of two different pathways to modify Rab 
protein function through post-translational modification. 

The reaction mediated by AnkX has similarities to the AMPylation 
reaction demonstrated for other FIC domain proteins. Both reactions 
use a nucleotide-based substrate as the donor molecule that mediates 
the post-translational modification process (Supplementary Fig. 9). 
Interestingly, in the AMPylation reaction, hydrolysis of the phosphoan- 
hydrous bond results in protein modification by the 5'-ribonucleotide 
of the donor substrate, whereas in the phosphocholination reaction the 
5'-ribonucleotide is presumably released and the phosphocholine 
group is transferred to the polypeptide chain. 

There are several examples of occasions where post-translational 
modifications introduced by bacterial toxins or effectors—which were 
thought to be the exclusive domain of pathogens—were discovered to 
represent mechanisms used to regulate eukaryotic cell functions. 
Curiously, the inclusion of a phosphocholine moiety in a protein 
structure has been indicated previously by studies examining peptides 
secreted by nematodes and from mammalian cells residing in the 
placenta’. Thus, protein phosphocholination may also be used by 
eukaryotic organisms to modulate cellular functions. 


METHODS SUMMARY 

MS/MS analysis of Rab GTPases was conducted on immunoprecipitated proteins 
that were fractionated by SDS-PAGE, digested with trypsin in the gel, and 
extracted peptides were separated using nano-LC and electrosprayed directly onto 
a linear ion trap mass spectrometer (LTQ Velos, ThermoElectron) for MS and MS/ 
MS analysis. All biochemical assays were conducted using purified proteins as 
described in Methods. The antibody TEPC-15 (Sigma) was used to detect phos- 
phocholinated proteins by immunonoblot analysis. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cell culture and transfection. COS7 and HEK293 cells were grown in Dulbecco’s 
modified Eagle medium (DMEM) from Gibco (Carlsbad) containing 10% heat- 
inactivated fetal bovine serum (FBS; Gibco). Cell lines were cultured at 37 °C in 5% 
CO3. For transfection, COS7 or HEK293 cells were added to 12-mm coverslips in 
24-well plates at a density of 2.5 < 10* cells per well. Cells were transfected with 
0.5 pg of each plasmid. Cells were fixed 18h after transfection in 4% PFA, per- 
meabilized with 0.05% saponin and processed for immunofluorescence micro- 
scopy as described previously". 

Fluorescence microscopy. Digital images were acquired with a Nikon TE300 
microscope using a X100 1.4 N.A objective lens and a Hamamatsu ORCA-ER 
camera controlled by IP Lab software. 

Alkaline phosphatase secretion assay. Secretion assays were performed as 
described”. Briefly, HEK293 cells were co-transfected with a plasmid encoding 
secreted alkaline phosphatase and a plasmid encoding either GFP, GFP-AnkX or 
GFP-AnkX(H229A). After an 18 h incubation period, cells were washed with PBS 
and fresh medium was added. Cells were incubated for 10h and then the alkaline 
phosphatase secretion index was determined by measuring the ratio of alkaline 
phosphatase protein secreted into the culture medium to the total amount of 
alkaline phosphatase protein in the assay well. The Tropix PhosphaLight System 
Kit (Applied BioSystems) was used to measure alkaline phosphatase activity and the 
Tecan Infinite M1000 plate reader with iControl Software was used to detect 
chemiluminescence. Data shown are the mean + s.d. from three independent 
samples for each condition. Results were validated in two independent assays. 
Protein purification and in vitro AMPylation. His-tagged and GST-tagged 
proteins were purified as described previously’. Purified GST-Rab1A (2 1g) was 
incubated with 0.4 1g of purified effector protein in buffer (20 mM HEPES pH 7.4, 
100 mM NaCl, 1 mM MgCl, and 0.1 mM GTPYS) and incubated for 1 h at 30 °C in 
the presence of 2 Ci of **P-c-labelled ATP or **P-y-labelled ATP (Perkin Elmer). 
Labelled proteins were identified by autoradiography following SDS-PAGE. In 
vitro AMPylation results shown are representative of three independent assays. 
Bacterial strains and plasmids. The Legionella strains were grown on charcoal 
yeast extract plates as described previously'*. The parental strain (wild type) was 
L. pneumophila serogroup 1 strain Lp01, and the variant strains were all isogenic 
mutants described previously''*”°”°, with the exception of the AankX AdrrA 
double mutant, which was generated for this study using allelic exchange to 
introduce the AankX mutation into the AdrrA strain as described". For all experi- 
ments, Legionella were isolated from charcoal yeast extract plates after growth for 2 
days at 37°C. The plasmid pEFGPC2 (Clontech) was used for all GFP fusion 
constructs, the plasmid pJB1806 was used to produce DrrA and AnkX in 
Legionella, pQE30 (Qiagen) was used for all His-tagged constructs, and 
pGEX2TK (GE LifeSciences) was used for all GST-tagged constructs. The Rab 
plasmids were constructed using cDNA encoding human RAB1A, RABSA, 
RAB6A, RAB35 and canine RABIA. 

Cell lines and Legionella infection. HEK293 FCyRII cells” were used to create the 
HEK293 FCyRII 3X-Flag-Rab1A stable cell line. This cell line was used for assays 
examining Rab1 modifications during infection of host cells by Legionella. For each 
assay, cells grown to near confluency in two 10-cm dishes were infected with 
opsonized Legionella pneumophila strain LPO1 (wild type) or the isogenic mutants 
at an estimated multiplicity of 100 bacteria to 1 host cell. After incubation for 0.5 h at 
37 °C the cells were lysed in buffer containing 20 mM HEPES pH 7.4, 100 mM NaCl, 
1mM MgCh, 1% Triton X-100, 1mM PMSF and protease inhibitor cocktail 
(Roche). Lysates were centrifuged at 17,900g and the post-nuclear supernatant 
was then incubated with Flag-antibody-coated beads (Sigma) at 4 °C for 1 h, washed 
and the Rab1 protein was eluted using Flag peptide. Samples were then processed for 
MS/MS analysis. This same approach was used to assay modifications to Rab35 after 
infection, except that the HEK293 FcyRII cell line was transfected with an expression 
plasmid producing 3X -Flag-Rab35 and then infected 18 h after transfection. 
Golgi and early endosome disruption assay. The disruption of Golgi and early 
endosomes was assessed in COS7 cells producing GFP—AnkX, GFP-AnkX(H229A), 
GFP-CBU_2078 or GFP alone. Cells were fixed with 4% PFA 18 h after transfection 
and stained with mouse monoclonal antibodies specific for either GM130 or EEA1 
(BD Transduction Laboratories) at a dilution of 1:200. Golgi and endosome mor- 
phology in GFP-positive cells was assessed by fluorescence microscopy. The Golgi 
was completely fragmented in nearly all the cells producing AnkX, whereas no 
significant fragmentation was observed compared to the background of untrans- 
fected cells in cells producing GFP-~AnkX(H229A), GFP—CBU_2078 or GFP alone. 
The early endosome disruption index represents the percentage of cells producing 
the indicated GFP protein that showed an enlarged endosome phenotype as deter- 
mined by EEA1 staining. The data represent the mean + s.d., from three independ- 
ent replicates in which 200 cells were counted for each protein. P values were 
computed using Student’s unpaired t-test. Data shown were validated in two inde- 
pendent experiments. 
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In vitro phosphocholination assay. GST-tagged Rab1A (51g) attached to glu- 
tathione agarose was incubated with thrombin to remove the tag. RablA was 
incubated with 1 j1g of purified effector protein in buffer (20 mM HEPES pH 7.4, 
100 mM NaCl, 1mM MgCh, 1mM ATP) and incubated for 1h at 30°C in the 
presence of 1 mM CDP-choline. Samples were boiled in SDS-loading buffer and the 
Rab1 protein was excised after SDS-PAGE and analysed by LC-MS/MS analysis. 
For the immunoblot analysis, the amounts of AnkX in each reaction varied from 
0 pg to 1.0 ig, and the amount of modified Rab] in each reaction was compared by 
immunoblot analysis using an antibody specific for phosphocholine (TEPC-15; 
Sigma) to detect modified Rab1 and an antibody specific for Rab1A (Santa Cruz 
Biotechnologies) to detect total Rab1. Data shown were validated in three inde- 
pendent experiments. 

Detection of host proteins modified by AnkX and CBU_2048 in vivo. HEK293 
cells in 24-well dishes were transfected with plasmids encoding GFP-CBU_2078, 
GFP-AnkxX or GFP-AnkX(H229A) and cultured for 18 h. Cells were lysed in buffer 
containing 20 mM HEPES pH 7.4, 100 mM NaCl, 1 mM MgCh, 1% Triton X-100, 
1mM PMSF and protease inhibitor cocktail. The lysates were centrifuged at 17,900g 
and 50 pg of the supernatant was separated by SDS-PAGE for immunoblot analysis 
using the anti-phosphocholine-specific antibody TEPC-15 (Sigma). 

GEF binding assays. To measure connecdenn binding to modified Rab35, a plas- 
mid encoding 3X -Flag-Rab35(S22N) was co-transfected into cells together with a 
plasmid encoding GFP-CBU_2078, GFP—AnkX or GFP-AnkX(H229A). The GDP- 
locked allele of Rab35 was used because it demonstrates enhanced connecdenn 
binding. Cellular lysates were prepared as described earlier, and post-nuclear super- 
natant was incubated with 4 1g of purified connecdenn (GenBank accession number 
NP_659414). 3X-Flag-Rab35(S22N) was precipitated using anti-Flag agarose 
(Sigma) and eluted from the beads using the 3X -Flag peptide (Sigma). The eluted 
proteins were resolved by SDS-PAGE gel and proteins were identified after the gel 
was stained with Coomassie brilliant blue dye. The locations of 3X-Flag- 
Rab35(S22N) and connecdenn were determined by running purified connecdenn 
and 3X -Flag-Rab35(S22N) in adjacent wells. To measure DrrA binding to modified 
Rab1, HEK293 cells that stably produce 3X-Flag—-Rab1 were transfected with GFP- 
CBU_2078, GFP-AnkX or GFP-AnkX(H2294A). 500 jig of post-nuclear supernatant 
was incubated with 0.5 tg of purified His-tagged DrrA (340-533). 3X -Flag—Rab1 was 
precipitated using anti-Flag agarose beads (Sigma). Immunoblot analysis was used to 
compare the amount of DrrA (anti-His), the amount of total 3X-Flag-Rab1 (anti- 
Flag, Sigma) and the amount of phosphocholinated Rab1 (TEPC-15; Sigma) after 2% 
of the total amount of 3X-Flag-Rab1 precipitated from each reaction was fractio- 
nated by SDS-PAGE. Data shown were validated in three independent experiments. 
Mass spectrometry. 3X-Flag—Rab proteins were immunoprecipitated from cells 
using anti-Flag agarose beads, and separated by SDS-PAGE. The band correspond- 
ing to the 3X-Flag-Rab protein was excised from the gel and treated with DTT to 
reduce disulphide bonds and then alkylated with iodoacetamide (IAM). Trypsin 
digestion was allowed to occur overnight. Resulting peptides were extracted from gel 
matrix and then resuspended in aqueous buffer before final LC-MS/MS analysis. 
Nanoflow reverse-phase LC separation was carried out on a Proxeon EASY-nLC 
system (Thermo Scientific). The capillary column (75 tm X 150 mm, PICOFRIT, 
New Objective) was packed in-house. A methanol slurry containing 5 um, 100 A 
Magic C18AQ silica-based particles (Microm BioResources) was forced to run 
through an empty capillary (with a frit in the end) using a pressurized device. The 
LC mobile phase was comprised of solvent A (97% HO, 3% acetonitrile (ACN) and 
0.1% formic acid (FA)) and solvent B (100% ACN and 0.1% FA). The nano-LC 
separation was performed with the following gradient: B was increased from 7% to 
35% in 40 min and then raised to 90% in 3 min and kept there for 10 min before 
going back to 100% A for column equilibration. At the moment when peptides were 
eluted from the capillary column, they were electrosprayed directly onto a linear ion 
trap mass spectrometer (LTQ Velos, ThermoElectron) for MS and MS/MS analysis. 
A data-dependent mode was enabled for peptide fragmentation with one full MS 
scan followed by collision induced dissociation (CID) of the ten most intense peptide 
ions. Dynamic exclusion was enabled to preclude repeated analyses of the same 
precursor ion. MS/MS scans were processed and searched using MASCOT 
(Matrix Science). The resulting peptide and protein assignments were filtered to 
keep only those identifications with scores above extensive homology. High-resolu- 
tion MS/MS analysis was performed and the data was acquired by a high-resolution 
mass spectrometer (Orbitrap) by the Keck Proteomic Facility at Yale University. All 
LC-MS/MS data was validated by at least two independent experiments. 
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Atlantic cod (Gadus morhua) is a large, cold-adapted teleost that sus- 
tains long-standing commercial fisheries and incipient aquaculture’”. 
Here we present the genome sequence of Atlantic cod, showing 
evidence for complex thermal adaptations in its haemoglobin gene 
cluster and an unusual immune architecture compared to other 
sequenced vertebrates. The genome assembly was obtained exclu- 
sively by 454 sequencing of shotgun and paired-end libraries, and 
automated annotation identified 22,154 genes. The major histo- 
compatibility complex (MHC) II is a conserved feature of the 
adaptive immune system of jawed vertebrates**, but we show that 
Atlantic cod has lost the genes for MHC I, CD4 and invariant chain 
(Ii) that are essential for the function of this pathway. Nevertheless, 
Atlantic cod is not exceptionally susceptible to disease under 
natural conditions’. We find a highly expanded number of 
MHCI genes and a unique composition of its Toll-like receptor 
(TLR) families. This indicates how the Atlantic cod immune system 
has evolved compensatory mechanisms in both adaptive and innate 
immunity in the absence of MHCII. These observations affect 
fundamental assumptions about the evolution of the adaptive 
immune system and its components in vertebrates. 

We sequenced the genome of a heterozygous male Atlantic cod 
(NEAC_001, Supplementary Notes 1 and 2), applying a whole- 
genome shotgun approach to 40X coverage (estimated genome size 
of 830 megabases (Mb), Supplementary Note 4 and Supplementary 
Fig. 2) using 454 technology (Supplementary Note 3). Two programs 
(Newbler® and Celera’, Supplementary Notes 5 and 6) produced 
assemblies with short contigs, yet with scaffolds of comparable size to 
those of Sanger-sequenced teleost genomes (Supplementary Note 10 and 
Supplementary Fig. 8). Although fragmentation due to short tandem 
repeats is difficult to address (Supplementary Note 7), we resolved 
numerous gaps attributable to heterozygosity (Supplementary Note 8). 
The assemblies differ in scaffold and contig length (Table 1), although 
their scaffolds align to a large extent (Supplementary Note 9 and Sup- 
plementary Fig. 7). We obtained about one million single nucleotide 
polymorphisms (SNPs) by mapping 454 and Illumina reads from the 
sequenced individual to the Newbler assembly (Supplementary Note 11). 
Both assemblies cover more than 98% of the reads from an extensive 
transcriptome data set, indicating that the proteome is well represented 
(Supplementary Note 13). The assemblies are consistent with four 


independently assembled bacterial artificial chromosome (BAC) insert 
clones (Supplementary Note 14 and Supplementary Fig. 9), and with 
the expected insert size of paired BAC-end reads (Supplementary Note 
15 and Supplementary Fig. 10). 

A standard annotation approach based on protein evidence was 
complemented by a whole-genome alignment of the Atlantic cod with 
the stickleback (Gasterosteus aculeatus), after repeat-masking 25.4% of 
the Newbler assembly (Supplementary Note 16 and Supplementary 
Table 6). In this way, 17,920 out of 20,787 protein-coding stickleback 
genes were mapped onto reorganized scaffolds (Supplementary Note 17). 
Additional protein-coding genes, pseudogenes and non-coding RNAs 
were annotated using the standard Ensembl pipeline. These approaches 
resulted in a final gene set of 22,154 genes (Supplementary Table 7). 
Comparative analysis of gene ontology classes indicates that the major 
functional pathways are represented in the annotated gene set 
(Supplementary Note 18 and Supplementary Fig. 11). We anchored 
332 Mb of the Newbler assembly to 23 linkage groups of an existing 
Atlantic cod linkage map using 924 SNPs* (Supplementary Note 19 and 
Supplementary Table 8). These linkage groups have distinct orthology 
to chromosomes of other teleosts, on the basis of the number of co- 
occurring genes, showing that the whole-genome shotgun assembly 
reflects the expected chromosomal ancestry (Fig. 1, Supplementary 
Note 20 and Supplementary Table 9). 


Table 1 | Assembly statistics 


Number — Bases (Mb) N50L (bp)* N50 (n)+ ML (bp)t 
Newbler 
Contigss 284,239 536 2,778 50,237 76,504 
Scaffolds 6,467 611 687,709 218 4,999,318 
Entire assembly || 157,887 753 459,495 344 4,999,318 
Celera 
Contigss 135,024 555 7,128 19,938 117,463 
Scaffolds 3,832 608 488,312 373 2,810,583 
Entire assembly || 17,039 629 469,840 395 2,810,583 


*Minimum sequence length in which half of the assembled bases occur. 
+Number of sequences with lengths of N5OL or longer. 

¢Maximum length. 

8Contigs longer than 500 bp. 

\|Scaffolds and unplaced contigs. 
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Figure 1 | Synteny between Atlantic cod and selected teleosts. The co- 
occurrence of orthologous genes (with a minimum of 50% sequence identity 
over 50% of the alignment, sphere size indicates the numbers of syntenic genes) 
in 23 Atlantic cod linkage groups® (x-axis) reveals synteny with the 
chromosomes of four teleosts (y-axis). Several genes located on the stickleback 
chromosome XIV, tetraodon chromosome 4 and medaka chromosome 12 
indicate a lineage-specific chromosomal rearrangement in Atlantic cod. 


Well-studied haemoglobin polymorphisms in Atlantic cod are indi- 
cative of functional molecular adaptation to thermal variation’”*. The 
genome contains nine «- and B-globin genes that are organized in two 
unlinked clusters, 65-«1-$1-a4 and [3-$4-u2-03-[2 (refs 13, 14). 
We discovered an indel polymorphism of 73 base pairs (bp) in the 
intergenic promoter region of the ~1-f1 globin pair (Fig. 2a and 
Supplementary Note 21). This promoter polymorphism occurs in 
highly significant linkage disequilibrium with two known polymorphic 
sites in the [1 gene, the Val55Met and Ala62Lys substitutions’, in eight 
Atlantic cod populations (Supplementary Note 22 and Supplementary 
Fig. 12). In fact, in the three most northern Atlantic populations and in 
both Baltic populations, the cod f1-globin gene predominantly occurs 
as a single homozygous genotype consisting of the long promoter and 
the Val 55-Ala 62 allele (Supplementary Table 10). By placing the two 
promoter variants in front ofa luciferase reporter gene and transfecting 
the constructs into salmon kidney cells (Supplementary Note 23), we 
found that temperature and promoter type have a significant inter- 
action effect (generalized linear model, F,, 35 = 7.85, P = 0.007, Fig. 2b) 
and that the long promoter has twofold higher transcriptional activity 
compared to the short promoter at 15 °C and 20°C. Increased globin 
synthesis of the Val 55-Ala 62 allele would compensate for its lower 
oxygen affinity'®” at high temperatures. Thus, the promoter poly- 
morphism provides a molecular compensatory mechanism that helps 
to maintain the total oxygen-carrying capacity'’. The tight linkage 
between the two types of polymorphism provides a compelling 
example of the coevolution of structural and regulatory adaptation, 
and highlights the relationship between temperature and functional 
molecular variation in the haemoglobin system'"®. 

The Atlantic cod immune system has unusual properties that set it 
apart from that of other teleosts: high levels of IgM’’, a minimal antibody 
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Figure 2 | Functional haemoglobin polymorphisms in Atlantic cod. 

a, Schematic of the head-to-head organized «1 and £1 globin genes, the 
intergenic promoter region and transcription start sites (red arrows). A 
promoter polymorphism consisting of a 73-bp indel (red box) segregates in 
linkage disequilibrium with two amino-acid-substitution polymorphisms 
(vertical lines) at positions 55 and 62 in B1 globin that affect its oxygen-binding 
affinity. This linkage disequilibrium results in two predominant haplotypes, 
long-Val-Ala and short-Met-Lys. b, Normalized luciferase luminescence ratios 
in salmon kidney cells. Cells were transfected using the long promoter (black 
circles) or the short promoter (white circles) and incubated at 4°C, 15 °C or 
20 °C (n = 3 for each treatment level). Error bars show 95% confidence intervals. 
response after pathogen exposure”’”’* and abundant phagocytic neu- 
trophils in the peripheral blood'*”°. Despite speculation, the exact causes 
for these differences remain unknown’. We found that most genes 
involved in the vertebrate immune response are present in Atlantic 
cod (Supplementary Note 24, Supplementary Fig. 13 and Supplemen- 
tary Table 11). Nevertheless, we did not find genes for the MHCII 
isoforms, their assembly and trafficking chaperone Ii*' and the 
MHC lIl-interacting protein CD4, which is essential for helper T-cell 
activation. By comparing a comprehensive set of vertebrate MHC II, 
CD4 and Ii sequences to the genome assemblies and all unassembled 
454 and Illumina sequencing reads (a data set of about 49.5 gigabases), 
we detected a truncated pseudogene for CD4 (Supplementary Note 25), 
which is located in a region of conserved synteny (Supplementary Note 
27 and Supplementary Fig. 18). No traces of MHCII and Ii were found 
in syntenous regions (Supplementary Note 27 and Supplementary Figs 
16, 17, 19 and 20) and quantitative PCR (qPCR) targeting a conserved 
domain in MHC II did not amplify the target sequence (Supplementary 
Note 26 and Supplementary Fig. 15). The absence of MHC II and li, and 
the pseudogenic nature of CD4, show that Atlantic cod has lost the 
function of the classical pathway for adaptive immunity against bacterial 
and parasitic infections. Nevertheless, Atlantic cod deals adequately 
with its prevailing pathogen load in its natural ecological settings’. 
Previous transcriptional (complementary DNA) studies in Atlantic 
cod have indicated an expansion of the number of MHCI loci”. By 
targeting the conserved MHCI «3 domain in genomic DNA using 
qPCR, we quantified more accurately the number of loci belonging to 
the teleost U-lineage”* (Supplementary Note 28). Notably, Atlantic cod 
has about 100 classical MHCI loci, which is a highly expanded number 
compared to other teleosts (Fig. 3a). A phylogenetic analysis of teleost 
MHCI sequences supports the existence of two clades in cod (Fig. 3b 
and Supplementary Note 29). Within each clade, the mutation patterns 
show statistically significant signs of positive selection that are indicative 
of subfunctionalization. These findings indicate that loss of MHCII 
functionality has coincided with a more versatile usage of the cytosolic 
pathway of MHCI. Two different MHCI antigen-presentation path- 
ways—the classical pathway and the alternative cross-presentation 
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Figure 3 | MHCI diversity in Atlantic cod. a, Copy-number estimates of the 
MHCI «3 domain. Estimates are based on qPCR ratios (see Supplementary 
Note 28) of the MHCI «3 domain and a single-copy reference gene. For 
Atlantic cod, B2-microglobulin and topoisomerase III- (*) were used as 
reference genes; for human and stickleback, §2-microglobulin was used. The 
estimates for human and stickleback agree with the expected number of «3 
domains found in both reference genomes (Supplementary Table 15). Black 
dots indicate 95% confidence intervals calculated by bootstrapping 

(n = 50,000). b, Phylogeny of amino-acid sequences of MHCI «1-«3 domains 
in teleosts. The Atlantic cod sequences are derived from cDNA and comprise 
classical U-lineage MHCI only. The other teleost sequences were obtained 
from Ensembl and NCBI, and contain classical and non-classical U-lineage 
MHCI. Alignments were visually inspected and corrected where necessary. 
Maximum likelihood (ML) values and Bayesian posterior probabilities (dots) 
support the main branches on the ML topology. Distance represents the 
number of substitutions per site (scale bar). The ratio of non-synonymous to 
synonymous variable sites (Ka/Ks), the average nucleotide diversity per site (1) 
and Tajima’s D (D) were calculated for the two main clades in Atlantic cod. 


pathway—can initiate immune responses in mammals”. The cross- 
presentation pathway represents a structural and cellular modification 
of the MHCI machinery that allows activation of CD8* T cells upon 
bacterial infection. The cytokine gene profile of Atlantic cod (Sup- 
plementary Table 11) supports the possibility of generating different 
subsets of CD8* T cells that either provide direct protection or regulate 
other immune cells, and thus compensate for the loss of CD4~ T cells. 

In addition to the MHCI expansion, we found an unusual composi- 
tion of the highly conserved TLR families that have a fundamental role 
in the innate immune response and the initial detection of pathogens. 
Teleost TLR-encoding genes occur in well-supported phylogenetic 
clusters, most of which share functional properties with mammalian 
orthologues, although some are fish-specific’®. The Atlantic cod TLR 
genes form monophyletic groups within the known teleost functional 
groups (Fig. 4, Supplementary Note 30 and Supplementary Fig. 22). 
Genes for several TLRs that recognize bacterial surface antigens (TLR1, 
TLR2 and TLRS) are, however, absent, leaving only the teleost-specific 
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Figure 4 | Phylogeny of TLR families in Atlantic cod. TLR protein sequences 
were selected on the basis of the conserved Toll-IL-1 receptor (TIR) domain for 
Atlantic cod, including known sequences from stickleback, zebrafish, 
tetraodon, fugu, medaka and human as references. TLR clades with (*) or 
without (—) Atlantic cod sequences are denoted according to human or teleost 
orthologues (summary tree topology, top left panel). Distance represents the 
average number of substitutions per site (scale bar). ML values and Bayesian 
posterior probabilities greater than 75/0.75 support the ML topology. Detailed 
topologies of TLR7 (blue), TLR8 (purple), TLR9 (green) and TLR22 (grey) 
show gene expansions for Atlantic cod (red). Multiple TLR copies within 
species are subdivided by letters, and follow Ensembl nomenclature for D. rerio. 


TLR14 and TLR18 as members of the TLR1 family in Atlantic cod. 
Moreover, several families of TLRs that recognize nucleic acids (TLR7, 
TLR8, TLR9 and TLR22) have markedly expanded, resulting in the 
highest number of TLRs found in a teleost so far. This TLR repertoire 
indicates that the Atlantic cod immune system relies relatively heavily 
on nucleic-acid-detecting TLRs to recognize bacterial pathogens. 
Notably, the gene expansion of TLR9 coincides with an expansion of 
interleukin-8 genes (IL-8, Supplementary Table 11). IL-8 is an import- 
ant chemokine in the innate immune response and is directly induced 
by TLR9 in human neutrophils”. The corresponding expansions of 
IL-8 and TLR9 indicate that this signalling cascade is particularly 
important in Atlantic cod. 

The loss of MHCII function and lack of a CD4* T-cell response 
represent a fundamental change in how the adaptive immune system is 
initiated and regulated in Atlantic cod. The marked expansion of 
MHCI genes and unusual TLR composition signify a shift of its 
immune system in handling microbial pathogens. An expanded 
MHCTI repertoire in the presence of a non-polymorphic MHCII is 
found in an evolutionarily-distant vertebrate, the axolotl (Ambystoma 
mexicanum)*”°, These observations indicate that anomalous immune 
systems (possibly analogous to that of Atlantic cod) have evolved 
independently. Additionally, we did not recover evidence for 
expressed MHCII, CD4 and Ii in the transcriptomes of three other 
gadoids, indicating that the unusual immune system is a derived char- 
acteristic of the gadoid lineage (Supplementary Tables 18 and 19). 

We have provided the first annotated genome of a species that 
supports extensive fisheries and is on the verge of becoming an import- 
ant aquaculture species. This work provides a major foundation for 
addressing key issues related to the management of natural Atlantic 
cod populations, such as the concept of fisheries-induced evolution, 
which dictates that selective harvesting can change the evolutionary 
trajectory of major life-history traits of natural populations”. 
Moreover, our novel findings regarding the immune system will allow 
for more targeted vaccine development, aiding disease management 
and the process of domestication of Atlantic cod. These findings 
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change fundamental assumptions regarding the evolution of the ver- 
tebrate immune system. 


METHODS SUMMARY 


Detailed methods on the sequencing and assembly of data from genomic and 
transcriptomic origins; annotation, synteny analyses, transfection experiments, 
bioinformatic analyses and phylogenetic analyses presented in this manuscript 
are described in the Supplementary Information. 
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